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Preface 



Cantor's theory of cardinality violates common sense. It says, for example, that 
all infinite sets of integers are the same size. This thesis criticizes the arguments 
for Cantor's theory and presents an alternative. 

The alternative is based on a general theory, CS (for Class Size). CS consists 
of all sentences in the first order language with a subset predicate and a less-than 
predicate which are true in all interpretations of that language whose domain is 
a finite power set. Thus, CS says that less than is a linear ordering with highest 
and lowest members and that every set is larger than any of its proper subsets. 
Because the language of CS is so restricted, CS will have infinite interpretations. 
In particular, the notion of one-one correspondence cannot be expressed in this 
language, so Cantor's definition of similarity will not be in CS, even though it 
is true for all finite sets. 

We show that CS is decidable but not finitely axiomatizable by characterizing 
the complete extensions of CS. CS has finite completions, which are true only 
in finite models and infinite completions, which are true only in infinite models. 
An infinite completion is determined by a set of remainder principles, which 
say, for each natural number, n, how many atoms remain when the universe is 
partitioned into n disjoint subsets of the same size. 

We show that any infinite completion of CS has a model over the power set 
of the natural numbers which satisfies an additional axiom: 

OUTPACING. If initial segments of A eventually become smaller 
than the corresponding initial segments of B, then A is smaller than 
B. 

Models which satisfy OUTPACING seem to accord with common intuitions 
about set size. In particular, they agree with the ordering suggested by the 
notion of asymptotic density. 
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Introduction 



1.1 The Problem 



This paper proposes a theory of set size which is based on intuitions, naive and 
otherwise. The theory goes beyond intuitions, as theories will, so it needs both 
justification and defense. I spend very little time justifying the theory; it is so 
clearly true that anyone who comes to the matter without prejudice will accept 
it. I spend a lot of time defending the theory because no one who comes to the 
matter comes without prejudice. 

The prejudice stems from Cantor's theory of set size, which is as old as sets 
themselves and so widely held as to be worthy of the name the standard theory. 
Cantor's theory consists of just two principles: 

ONE-ONE. Two sets are the same size just in case there is a one-one 
correspondence between them. 

CANTOR K . A set, x, is smaller than a set, y, just in case x is the 
same size as some subset of y, but not the same size as y itself. 

A one-one correspondence between two sets is a relation which pairs each 
member of either set with exactly one member of the other. For example, the 
upper-case letters of the alphabet can be paired with the lower-case letters: 



So, the standard theory says, the set of upper-case letters is the same size 
as the set of lower-case letters. Fine and good. 

The standard theory also says that the set of even numbers is the same size 
as the set of integers since these two sets can also be paired off one-to-one: 
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Similarly, the standard theory says that the set of positive even integers is 
the same size as the set of prime numbers: pair the n-th prime with the n-th 
positive even number. In both of these cases, common sense chokes on the 
standard theory. 

In the first case, common sense holds that the set of integers is larger than 
the set of even integers. The integers contain all of the even integers and then 
some. So it's just good common sense to believe there are more of the former 
than the latter. This is just to say that common sense seems to follow: 

SUBSET. If one set properly includes another, then the first is larger 
than the second. 

even into the infinite, where it comes up against the standard theory. 

Common sense can make decisions without help from SUBSET. Though 
the set of primes is not contained in the set of even integers, it is still clear to 
common sense that the former is smaller than the latter. One out of every two 
integers is even, while the prime numbers are few and far between. No doubt, to 
use this reasoning, you need a little number theory in addition to common sense; 
but, given the number theory, it's the only conclusion common sense allows. 

The theory proposed here accommodates these bits of common sense rea- 
soning. It maintains SUBSET and a few and far between principle and much 
else besides. To state this theory, we use three two-place predicates: <, ~, and 
>. If A and B name sets, then 

• r A < B n is read as A is smaller than B, 

• r A ~ B~* is read as A is the same size as B, and 

• r A>B~ l is read as A is larger than B. 

Incidentally, we assume throughout this thesis that the following schemata 
are equivalent, item by item, to the readings of the three predicates given above, 
assuming that A is the set of a's and B is the set of f3's: 

a. • There are fewer a's than /3's. 

• There are just as many a's than /3's. 

• There are more a's than /3's. 

b. • The number of a's is less than the number of /3's. 

• The number of a's is the same as the number of /3's. 

• The number of a's is greater than the number of /3's. 

c. • The size of A is smaller than the size of B. 

• A and B are (or, have) the same size. 

• The size of A is larger than the size of B. 
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Regarding this last group, we emphasize that we are not arguing that there 
really are such things as set sizes, nor that there are not really such things. 
Statements about sizes can be translated in familiar and long-winded ways into 
statements about sets, though we will not bother to do so. 

I have identified the standard theory, Cantor's, with two principles about set 
size. The term size, however, is rarely used in connection with Cantor's theory; 
so it might be wondered whether the standard theory is really so standard. In 
stating ONE-ONE and CANTOR" 1 , Cantor used the terms power and cardinal 
number rather than size. In the literature, the term cardinal number (sometimes 
just number) is used most frequently. If someone introduces cardinal number 
as a defined predicate or as part of a contextual definition (e.g. "We say that 
two sets have the same cardinal number just in case ... "), there is no point in 
discussing whether that person is right about size. 

Though Cantor's theory is usually taken as a theory of set size, it can also 
be taken as just a theory of one-one correspondences. More specifically, saying 
that two sets are similar iff they are in one-one correspondence can either be 
taken as a claim about size or be regarded as a mere definition. Whether or 
not similarity is coextensive with being the same size, the definition is worth 
making. The relation picked out is well studied and well worth the study. The 
technical brilliance of the theory attests to this: it has given us the transfinite 
hierarchy, the continuum problem, and much else. In addition, the theory has 
consequences which do not prima facie seem to have anything to do with size 
or similarity: the existence of transcendental numbers comes to mind. All of 
this is to say that the interest in one-one correspondence has not been sustained 
solely by its identification with the notion of size. Hence, denying that they are 
the same does not endanger the theory of one-one correspondences, per se. 

But most mathematicians and philosophers do not use cardinal number as 
a mere abbreviation. They use the term in just the way that we use size and 
slide freely among (A), (B), and (C). This is true, in particular, of Cantor, who 
offered ONE-ONE as a theory; indeed, he offered an argument for this theory 



1.2 Cantor's argument 

Cantor bases his argument for ONE-ONE on the idea that the size of a set, it's 
cardinal number, depends on neither the particular elements it contains nor on 
how those elements are arranged: 

The cardinal number, \M\, of a set M (is the general concept which) 
arises from M when we make abstraction of the nature of its elements 



and of the order in which they are given. [Cantor, p 



But to say that the cardinal number of a set does not depend on certain things 
is not to say what the cardinal is. Neither does it insure that two sets have 
the same cardinal number just in case they are in one-one correspondence. To 
flesh out this notion of double abstraction, Cantor reduces it to a second ab- 
straction operator, one which works on the elements of sets rather than the sets 
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themselves: 

Every element, m, if we abstract from its nature becomes a unit. 



Cantor 



P- 



and so, concludes Cantor: 

The cardinal number, \M\, is a set composed of units (which has 
existence in our minds as an intellectual image or projection of M.) 



Cantor, p 



According to Jourdain, Cantor 

distinguised very sharply between an aggregate and a cardinal num- 
ber that belongs to it: "Is not an aggregate an object outside us, 
whereas its cardinal number is an abstract picture of it in our mind." 



Cantor] p.80] 



I have parenthesized the expressions above where Cantor describes cardinal 
numbers as mental entities. Nevertheless, I can only make sense of his arguments 
insofar as he treats cardinal numbers as sets: he refers to them as 'definite 
aggregates', supposes that they have elements, and employs mappings between 
cardinal numbers and other sets. 

The following three statements seem to express Cantor's intent: 



(1.1) \M\ = {y \3x G M Ay = Abstract(x) } 

(1.2) Vx3y(y = Abstract(x) A Unit(y)) 

(1.3) VMVy(y G \M\ -» Unit(y)) 

Abstract (x) is to be read as the result of abstracting from the element x, 
Unit (x) as x is a unit, and \M\ as the cardinal number of M. 

So (1.1) gives a definition of cardinal number, in terms of the operation of 
abstraction, from which Cantor proves both ONE-ONEa and ONE-ONEb. 

(ONE-ONEa) M ~ N -> |M| = |iV| 

(ONE-ONEb) \M\ = \N\ -> M ~ N 

ONE-ONEa is true, says Cantor, because 

the cardinal number \M\ remains unaltered if in the place of one 
or many or even all elements m of M other things are substituted. 
" p.80] 



Cantor 



and so, if / is a one-one mapping from M onto N, then in replacing each clement , 
m, of M with f(m) 

M transforms into N without change of cardinal number, (p. 88) 
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In its weakest form, the principle Cantor cites says that if a single element of 
M is replaced by an arbitrary element not in M then the cardinal number of 
the set will remain the same. That is, 

(1.4) (aeMA^M) 

AN = {x\ (x e M A x a) V x £ b} 

- \M\ = \N\ 

The reasoning is clear: so far as the cardinal number of a set is concerned, one 
element is much the same as another. It is not the elements of a set, but only 
their abstractions, that enter into the cardinal number of a set. But abstractions 
of elements are just units; so one is much the same as another. 
ONE-ONEb is true, Cantor says, because 

... \M\ grows, so to speak, out of M in such a way that from every 
element mofMa special unit of \M\ arises. Thus we can say that 
M~\M\. 

So, since a set is similar to its cardinal number, and similarity is an equivalence 
relation, two sets with same cardinal number are similar. Unless each element 
of a set abstracts to a 'special', i.e. distinct, unit, the correspondence from M 
to its cardinal number will be many-one and not one-one. A weak version of 
this principle is: 

M = {a,b} A a ^ b 

(1.5) -> |M| = {Abstract(a), Abstract^)} 

A Abstract(a) ^ Abstract(6) 

These two arguments do one another in. (1.4) says that replacing an element 
of a set with any element not in the set does not affect the cardinality. But, by 
the definition of \M\, (1.1), this means that 

(1.6) Va;Vy(Abstract(.T) = Abstract(y)) 

For, consider an arbitrary pair of elements, a and b. Let M = {a} and let 
N = {b}. So, the conditions of (1.4) arc met and \M\ = \N\. But |M| = 
{Abstract(a)} and \N\ = {Abstract(6)}, by (1.1). So Abstract(a) = Abstract(fe). 
Generalizing this argument yields (1.6). 

So Cantor's argument for ONE-ONEa only works by assigning all non- 
empty sets the same, one-membered, cardinal number. But, this contradicts 
ONE-ONEb. 

Conversely, the argument that a set is similar to its cardinal number relies 
on (1.5), which entails 



(1.7) 



Vxiy(x ^ y -> Abstract^) ^ Abstract(y)) 
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assuming only that any two objects can constitute a set. But if the abstractions 
of any two elements are distinct, then no two sets have the same cardinal number 
as defined by (1.1), contra ONE-ONEa. 

There is no way to repair Cantor's argument. Rather than leading to a 
justification of ONE-ONE, Cantor's definition of cardinal number is sufficient 
to refute the principle. The negation of (1.6) is: 

(1.8) 3x3y(Abstract(a;) ^ Abstract(y)) 

So one of (1.6) and (1.8) must be true. We have shown that (1.6) contradicts 
ONE-ONEb. Similarly, (1.8) contradicts ONE-ONEa; if a and b have distinct 
abstractions, then {a} and {b} have distinct cardinal numbers, {Abstract(a)} 
and {Abstract(fr)}, despite the fact that they are in one-one correspondence. So 
ONE-ONE is false whether (1.6) or its negation, (1.8), is true. 



1.3 Cantor and the logicists 

Though both Frege and Russell accepted Cantor's theory of cardinality, neither 
accepted Cantor's argument. Frege spends an entire chapter of the Grundlagen 
mocking mathematicians from Euclid to Schroder for defining numbers as sets 
of units. He neatly summarizes the difficulty with such views: 

If we try to produce the number by putting together different dis- 
tinct objects, the result is an agglomeration in which the objects 
remain still in possession of precisely those properties which serve to 
distinguish them from one another, and that is not the number. But 
if we try to do it in the other way, by putting together identicals, 
the result runs perpetually together into one and we never reach a 
plurality . . . 

The word 'unit' is admirably adapted to conceal the difficulty . . . We 
start by calling the things to be numbered 'units' without detracting 
from their diversity; then subsequently the concept of putting to- 
gether (or collecting, or uniting, or annexing, or whatever we choose 
to call it) transforms itself into arithmetical addition, while the con- 
cept word 'unit' changes unperceived into the proper name 'one'. 



Frege], pp 50-51] 



These misgivings about units do not prevent Frege from basing his definition 
of 'number' and his entire reduction of arithmetic on Cantor's notion of one- 
one correspondence. "This opinion", says Frege, "that numerical equality or 
identiy must be defined in terms of one-one correlation, seems in recent years to 



have gained widespread acceptance among mathematicians" [Frege, pp. 73-74]. 
Frege cites Schroder, Kossak, and Cantor. 



Russell displays similar caution about Cantor's argument [Russell, p. 305] and 
similar enthusiasm for his theory (see the quote at the beginning of Chapter 2, 
for example.) 



1.4. AIMS AND OUTLINE 
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Of course, Frege and Russell cleaned up Cantor's presentation of the theory. 
Russell, for example, notes that Cantor's statement (1) is not a 'true definition' 
and 

merely presupposes that every collection has some such property as 
that indicated — a property, that is to say, independent of its terms 
and their order; depending, we might feel tempted to add, only upon 



their number. Russell, pp. 304-305] 



So Russell, and similarly Frege, relied upon the principle of abstraction to obtain 
a 'formal definition' of cardinal numbers, in contrast to Cantor, who had "taken" 
number "to be a primitive idea" and had to rely on "the primitive proposition 



that every collection has a number." [Russell, p. 305] 



So, while some people regard Cantor's ONE-ONE as just a definition and 
others embrace it as a theory, the logicists have it both ways: adding ONE-ONE 
as a formal definition to set theory (or, as they would call it, logic) they have no 
obligation to defend it and can steer clear of peculiar arguments about units; at 
the same time, they can advance it as a great lesson for simple common sense. 

The logicists' adoption of Cantor's theory of cardinality needs no great ex- 
planation: it came with set theory and, to a large extent, motivated set theory 
and determined its research problems. But there are two specific reasons that 
they should have seized upon ONE-ONE and CANTOR < . First, they both 
have the form of definitions, no matter how they are intended. So the notion of 
cardinality is born reduced. 

Second, Cantor's theory clears the way for other reductions. Suppose, for 
example, you wish to reduce ordered pairs to sets. Well, you have to identify 
each ordered pair with a set and define the relevant properties and relations 
among ordered pairs in terms of properties and relations among sets. One 
of the relations that has to be maintained is identity; so each ordered pair 
must be identified with a distinct set. In addition, the relative sizes of sets of 
ordered pairs should be preserved under translation. But, if ONE-ONE is the 
correct theory of size, then this second condition follows from the first, since the 
existence of one-one correspondences will be preserved under a one-one mapping. 



1.4 Aims and outline 

It would be naive to suppose that people's faith in Cantor's theory would be 
shaken either by refuting specific arguments for ONE-ONE or by associating the 
acceptance of that theory with a discredited philosophy of mathematics. Such 
points may be interesting, but in the absence of an alternative theory of size 
they are less than convincing. 

This dissertation presents such an alternative. Chapter 2 canvasses common 
sense intuitions for some basic principles about set size. Chapter 3 reorganizes 
those principles into a tidy set of axioms, offers an account of where the intu- 
itions come from (viz. known facts about finite sets), and mines this source for 
additional principles. Chapters 4 and 5 prove that the theory so obtained is 
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complete, in the sense that it embraces all facts about finite sets of a certain 
kind (i.e. expressible in a particular language). Finally, Chapter 6 elaborates 
additional principles that concern only sets of natural numbers and demon- 
strates that these additional principles, together with the theory in Chapter 3, 
are satisfiable in the domain of sets of natural numbers. 



2 

The General Theory 



The possibility that whole and part may have the same number 
of terms is, it must be confessed, shocking to common sense . . . 
Common sense, therefore, is here in a very sorry plight; it must 
choose between the paradox of Zeno and the paradox of Cantor. I 
do not propose to help it, since I consider that, in the face of the 



proofs, it ought to commit suicide in despair. [Russell, p. 358] 



Is common sense confused about set size, as Russell says, or is there a way 
of elaborating on common sense to get a plausible and reasonably adequate 
theory of cardinality? To be plausible, a theory should at least avoid principles 
and consequences which violate common sense. To be reasonably adequate, a 
theory has to go beyond bare intuitions: it should not rest with trivialities and 
it should answer as many questions about set size as possible, though it need 
not be complete. Plausibility and adequacy are conflicting demands: the first 
says that there should not be too many principles (no false ones, consistency), 
the second that there should not be too few principles. 

In the Introduction, I argued that a coherent theory of cardinality has to 
contain some principles that refer to the kinds of objects in sets, pace Cantor. In 
this chapter, however, I want to see how far we can go without such principles; 
i.e. how much can you say about smaller than without using predicates (other 
than identity) which relate the members of the sets being compared? I shall 
begin by stating a number of principles and explaining why they are included 
in the general theory. 



2.1 The Theory CORE 

First, there is the SUBSET principle: 

SUBSET. If a; is a proper subset of y, then x is smaller than y. 

The reason for including SUBSET should be obvious. What has prompted the 
search for an alternative to the standard theory of cardinality is the conflict 
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between ONE-ONE and SUBSET. Now, it is often said that common sense 
supports both of these principles and that it is in doing so that common sense 
is confused. From this it is supposed to follow that common sense cannot be 
relied upon, so we should opt for ONE-ONE, with the technical attractions that 
it provides. 

But, there's a difference in the way that common sense supports these two 
principles. There is no doubt that you can lead an unsuspecting person to 
agree to ONE-ONE by focusing their attention on forks and knives, husbands 
and wives, and so forth: i.e. finite sets. With carefully chosen examples, say 
the odds and the evens, you might even convince someone that ONE-ONE is 
true for infinite sets, too. Now, I do not think that such guile is needed to 
lead someone to agree to SUBSET, but that's not what my argument depends 
on. The argument hinges on a suggestion about how to resolve cases where 
mathematical intuitions seem to conflict. The suggestion is to see what happens 
with particular cases on which the principles conflict before you've lead someone 
to agree to either of the general statements. 

So, if you want to find out what common sense really thinks about SUBSET 
and ONE-ONE, you would present people with pairs of infinite sets, where one 
was a proper subset of the other. I've actually tried this, in an unscientific way, 
and what I've gotten, by and large, is what I expected: support for SUBSET. 
(By and large because many people think that all infinite sets have the same 
size: Infinity.) 

Naturally, I would not venture that this sort of technique, asking people, is 
any way to find out which of SUBSET and ONE-ONE is true. People's intu- 
itions about mathematics are notoriously unreliable, not to mention inconstant. 
Of course, harping on this fact might engender some unwarranted skepticism 
about mathematics. What I am suggesting is that there might be a rational 
way of studying mathematical intuitions and that we should at least explore 
this possibility before proclaiming common sense to be hopelessly confused on 
mathematical matters. 

So, SUBSET, all by itself seems to be a plausible alternative to Cantor's 
theory, though it surely is not enough. Given just this principle, it's possible 
that one set is smaller than another just in case it is a proper subset of the 
other. It's clear that we need additional principles. All of the following seem 
worthy (where x < y is to be read as x is smaller than y,x^yisto be read as 
x is the same size as y, x > y as x is larger than y. The Appendix gives a full 
account of the notations used throughout this thesis.) 



2.1. THE THEORY CORE 
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Theory 2.1.1. QUASI-LOGICAL 



(ASYM<) 
(ASYM>) 



x < y — > ~^y < x 
x > y — > ~^y > x 
(x<yAy<z)^x<z 
(x>yAy>z)^x>z 
x ~ y — > lndisc(x, y) 



(TRANS<) 
(TRANS >) 
(INDISCT) 



(REF~) 
(SYM~) 



x ~ y — > y ~ x 
(j:~j/Aj/~z)^j;~z 
x > y ^ y < x 



x 



x 



(TRANS~) 
(DEF>) 



w/iere lndisc(x, y) abbreviates \/z{{z < x <-> z < y) A (z > a; <-> z > y)) 

We call the principles listed above quasi-logical principles because it is 
tempting to defend them as logical truths. Consider the first principle, for 
example, in unregimented English: 

ASYM < . If x is smaller than y, then y is not smaller than x. 

This sentence can be regarded as an instance of the schema: 

ASYM F . If x is F-er than y, then y is not F-er than x. 

where F is to be replaced by an adjective from which comparatives can be 
formed, e.g. 'tall', 'short', 'happy', but not 'unique' or 'brick'. It appears that 
every instance of this schema is true, so it could be maintained that each is true 
in virtue of its form, that each is a logical truth. 

The other principles might be defended in the same way, though the schema 
for INDISC~ would have to be restricted to triples of corresponding compara- 
tives, for example: is smaller than, is larger than, is the same size as. 

But using such observations to support these principles would be problematic 
for two reasons. First, it would require taking positions on many questions 
about logical form and grammatical form which would take us far afield and, 
possibly, antagonize first-order logicians. Second, there are some instances of 
the schemata that make for cmbarassing counterexamples: 'further east than' 
(in a round world) and 'earlier than' (in, I'm told, a possible world). 

So, it might be that casting the principles above as instances of the appro- 
priate schema would only explain why they are part of common sense. What 
remains clear is that a theory of cardinality which openly denied any of these 
principles would be implausible: it would be ridiculed by common sense and 
mathematical sophisticates alike. I can just barely imagine presenting a theory 
which, for fear of inconsistency, withheld judgment on one or more of these 
principles. But to do so without good reason would be counterproductive. It 
seems that if a case could be made that these statements, taken together, are 
inconsistent with SUBSET, that would be good reason to say that there is no 
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reasonably adequate alternative to Cantor's theory. Since my goal is to counter 
such a conclusion, it seems that the proper strategy is to include such seemingly 
obvious truths and to show that the resulting theory is consistent. So the strat- 
egy here is not to adduce principles and argue for the truth of each. This would 
be impossible, given that the principles are logically contingent. Instead, our 
approach is to canonize what common sense holds to be true about cardinality 
and show that the result is consistent and reasonably adequate. 

As long as we restrict ourselves to SUBSET and the quasi-logical principles, 
consistency is no problem. After all, what do the quasi-logical principles say? 
Only that smaller than is a partial ordering, the larger than is the converse 
partial ordering, that the same size as is an equivalence relation, and that 
sets of the same size are indiscernible under the partial orderings. So, if we 
are given any domain of sets, finite or infinite, we get a model for our theory 
by assigning to < the relation of being a proper subset of, assigning to > the 
relation of properly including, and assigning to ~ the identity relation. Since 
common sense knows that different sets can be the same size, there must be 
some additional principles to be extracted from common sense. 

We shall now consider some principles which cannot be regarded as quasi- 
logical. 

First, there is the principle of trichotomy. 



which says that any two sets are comparable in size. While a theory of set size 
which excluded TRICH might escape ridicule, it would surely be regarded with 
suspicion. Indeed, if the principles of common sense were incompatible with 
TRICH, this would undoubtedly be used to discredit them. 
Second, there is the representation principle. 



which says that if a set, x, is smaller than another, y, then x is the same size 
as some proper subset of y. Now, this is a principle which common sense has 
no particular feelings about. Analogous statements about physical objects are 
neither intuitive nor very clearly true. For example, (1) does not stand a chance 
of being regarded as true: 

(1) If one table is smaller than another, then the first is the same 
size as some proper part of the second. 

if 'part' is taken to mean 'leg or top or rim or ...'. Even if common sense can 
be persuaded to take particles and arbitrary fusions of such as parts of tables, 
no one should condemn its residual caution about (1). If REP < is true, then it 
seems to be an interesting and special fact about sets. 

REP < was originally included in this theory for technical reasons; it makes 
it easier to reduce the set of axioms already presented and it provides a basis for 
several principles not yet presented. REP < may be open to doubt, but it is not 



(TRICH) 



x <yV x ~ yV x > y 



(REP<) 



x < y — > 3x'(x' ~iAi' C y) 



2.1. THE THEORY CORE 



13 



a principle that Cantorians could complain about, for it is entailed by Cantor's 
definition of <: 

(CANTOR<) x < y <-> -.(x - y) A 3a;' (a;' ~iAi'ci/) 

If CANTOR < is regarded as a principle instead of a definition, then it is entailed 
by the principles we have already mentioned: 

If x < y, then -.(a; ~ y), by INDISC~ and ASYM<. By REP<, 
some proper subset of y, say x', must be the same size as x. But 
x' < y, by SUBSET: sos< y', by INDISC~. 

Conversely, if x' ~ x and a;' C y, then a;' < y, by SUBSET. So 
x < y, by INDISC~. 

There are more principles to come, but before proceeding, I'd like to take stock 
of what we already have. First, I want to reduce the principles mentioned above 
to a tidy set of axioms. Second, I want to estimate how far we've gone. 

The entire set of principles already adopted are equivalent to the following, 
which will be referred to as the core theory. 

Theory 2.1.2. CORE, the core theory, 

(SUBSET) x C y -» x < y 

(DEF~) x ~ y <-> lndisc(x, y) 

(TRANS~) {x~yAy~ z )^x~z 
(DEF > ) x>y^y<x 
(IRREF<) -.(a; < x) 

(TRICH) x <yV x ~ yV x> y 

The only axiom in CORE that has not already been introduced is DEF~, 
which is logically equivalent to the conjunction of INDISC" and its converse 
"TNDISC: 

(INDISC~) x - y -> lndisc(a;, y) 

(-INDISC) lndisc(a;, y) x ~ y 

"INDISC says that if two sets fail to be the same size, then their being different 
in size is attributable to the existence of some set which is either smaller than 
one but not smaller than the other, or larger than one but not larger than the 
other. 

Theorem 2.1.3. Let T = QUASI; SUBSET; REP< . Then T = CORE. 
Proof. 

h T h CORE. We only need to show that ~INDISC is entailed by T. 
Suppose -i(a; ~ y). So x < y or y < x, by TRICH. But ->(x < x) and 
-.(»<!/), byIRREF<. 
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H CORE h T 

(i) TRANS < . If y < z, there is a y' such that y 1 ~ y and y' C z by 
REP<. If a; < y, then x < y' because y' ~ y, by DEF~. So, there 
is an x' such that x' ~ x and x' C y'. So x' C z and, by SUBSET, 
x' <z. But then x < z by DEF~. 

(ii) ASYM<. If x < y and y < x, then x < x, by TRANS <, contra 
IRREF<. 

(iii) TRANS > , ASYM>, and IRREF> follow from the corresponding 
principles for < and DEF > . 

(iv) INDISCT, SYM~, TRANS~, and REF~ arc logical consequences of 
DEF~. 

□ 

CORE is consistent. In fact, two kinds of models satisfy CORE. 
Definition 2.1.4. 

a. A is a finite class model with basis x iff 

(i) A — V(x), where x is finite. 

(ii) A^aCbiffaCb 
(iii) A^a<biff\a\< \b\ 

b. A is a finite set model iff 

(i) A — {x | x is a finite subset of Y }, for some infinite Y. 

(ii) A^aCbiffaCb 
(iii) A^a<biff\a\< \b\ 

Models can be specified by stipulation the smaller than relation since larger 
than and same size as are defined in terms of smaller than. 

Fact 2.1.5. 

a. If A is a finite class model, then A \= CORE. 

b. If A is a finite set model, then A \= CORE. 

Proof. In both cases, the finite cardinalities determine a quasi-linear ordering 
of the sets in which any set is higher than any of its proper subsets. □ 

The normal ordering of finite sets is, in fact, the only one that satisfies 
CORE. By adding TRICH we have ruled out all non-standard interpretations 
of <. 

Theorem 2.1.6. Suppose that A is a model such that 
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a. If x E A, then x is finite, 

b. If x E A and y C x, then y E A, and 

c. A^ CORE. 

Then A^a<biff\a\< \b\ 

Proof. We shall prove (*) by induction on n: 



:*) 


If a = n, then A 1= (a <- 


- 6) iff |6| = n 


Suppose 


n = 




then 


a = 




But if 


|6|=0 




then 


A N (a ~ b) 


by REF~ 


And if 


\b\?0 




then 


acb 




So 


A\={a<b) 


by SUBSET 


So 


A¥ (a ~6) 


by INDISC~ 


Mow, suppose that (*) is true for all i < n: 




If 


\a\ = \b\ = n+ 1 




then 


A¥ (a ~ b), as follows: 




Suppose 


A N (a < b) 




So 


A 1= (a' ~ a) for some a' C b 


by REP< 


But if 


A N a' C b 




then 


\a'\ < n 




So 


\a\ < n 


by (*), contra our hypothesis. 


But if 


\a \ = n + 1 




and 


A N (a ~ b) 




then 


\b\ > n + 1 


by induction. 


But if 


|6| > n+ 1 




pick 


6' C 6, 6' e A, 




with 


\b'\ =n + l 


by condition (b) 


So 


.4 h (&' < 6) 


by SUBSET 


and 


.A h (&' ~ a) 




So 


A h (a < 6) 


contra our supposition. 


So 


|&| =n + l 





□ 
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2.2 Addition of Set Sizes 

We shall now extend CORE to an account of addition of set sizes. Since the 
domains of our intended models contain only sets and not sizes of sets, we have 
to formulate our principles in terms of a three-place predicate true of triples of 
sets: Sum (x, y, z) is to be read as the size of z is the sum of the sizes of x and 

V ■ 

The following principles are sufficient for a theory of addition: 
Theory 2.2.1. Addition 

Functionality of addition 

(a) Sum(x,y,z) 

(b) Sum(x,y,z) 

(c) Sum(x,y,z) 

Addition for disjoint sets 
(DISJ+) xtly = -> Sum(x,y,xUy) 

Monotonicity of Addition 

(MONOT) Sum(x,y,z) ^ x < zV x ~ z 

FUNC + says that sets bear Sum relations to one another by virtue of their 
sizes alone. This condition must clearly be met if Sum is to be read as specified 
above. 

DISJ+ tries to say what function on sizes the Sum relation captures by fixing 
the function on paradigm cases: disjoint sets. But FUNC + and DISJ + leave 
open the possibility that addition is cyclic: suppose we begin with a finite class 
model whose basis has n elements and assign to Sum those triples (x, y, z) where 

\z\ = {\x\ + \y\) mod (n + 1) 

Both FUNC + and DISJ+ will be satisfied, though the interpretation of Sum docs 
not agree with the intended reading. MONOT rules out such interpretations. 

Given an interpretation of over a power set there is at most one way of 
interpreting Sum which satisfies ADDITION. We shall show this by proving 
that ADDITION and CORE entail DEF+: 

(DEF+) Sum(x,y,z)^ 

3x'3y'(x ~ x' A y ~ y' A x' n y' = A x 1 U y' = z) 
DEF + says that the extension of Sum is determined by the extension of j. 



— > ix ~ x <-> Sum(x , y, z) 
-> (V ~ y' <-+ Sum(x, y', z) 
— > (z ~ z' <-> Sum(x, y, z') 



2.2. ADDITION OF SET SIZES 
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A model of CORE must satisfy an additional principle, DISJ U , if Sum is to 
be interpreted in a way compatible with ADDITION, 



(DISJ U ) (x ~ x' A y ~ y' A x n y = A x' n y' = 0) 



(x U y) ~ (*' U y') 



(Note: The proofs in this chapter will use boolean principles freely, despite the 
fact that we have not yet introduced them.) 

Theorem 2.2.2. CORE + ADDITION h DISJ° 

Proof. 



Suppose 


(x ~ x' 


/\y^y'/\xC\y = \ 


d A x' n y' = 0) 


then 


Sum(x, 


y,xUy) 




and 


Sum(x' 


,y',x'u y ') 


by DISJ+ 


So 


Sum(x' 


,V,xUy) 


by FUNC+ (a) 


So 


Sum(x' 


,y',xUy) 


by FUNC+ (b) 


So 


x U y ~ x 1 U y' 


by FUNC+ (c) 



□ 

If the minimal conditions on addition are to be satisfied in a model of CORE, 
then Sum has to be definable by DEF+. 

Theorem 2.2.3. CORE + ADDITION h DEF+ 

Proof. (=>) 



Suppose 


Sum(a;, y, z) 




So 


x < z V x <~ z 


by MONOT 


But if 


x ~ z, 




Let 


x 1 = z 




and 


y' = 0; 




then 


Sum(x', y', z) 


by DISJ+ 


So 


Sum(x, y',z) 


by FUNC+ (a) 


So 


v ~y 


by FUNC+ (b) 


And if 


X < z 




pick 


x' C z 




with 


X ~ x 


by REP<. 


Let 


y = z — x 




But 


y' ~ y, as before. 
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(<=) 

Suppose (x ~ x' A y ~ y' A x' n y' = A x' U y' = z) 

then Sum(x',y',z) by DISJ+ 

So Sum(x,y',z) by FUNC+ (a) 

So Sum(x,y,z) by FUNC+ (b) 



□ 



Theory 2.2.4. EXCORE, the extended core, is CORED ADDITION. 

Fact 2.2.5. The following are consequences of EXCORE: 

x n yi = x n ?/2 = A j/i - y 2 -> a; U yi - x U y 2 
s fl yi = a; n )/2 = A )/i < |/2 ^ i U i/i < a: U i/2 
xi n j/i = xi n j/2 = A x\ ~ x 2 A yi < y 2 -> x 1 U yi < x 2 U y 2 
iCzAi/CzAi<!/-»(z - y) < (z — x) 
iCzAj/CzAj;~j/-»(2-i/)~(z-i) 

x < y -> 3y'(y' ~ y A x C y') 

x ~ y — > x — (x n y) ~ y — (x fl y) 

x < y — > x — (x n y) < y — (x n y) 

Proof. The proofs are elementary. □ 
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A Formal Theory of Class 
Size 

In the preceding chapter, we searched for principles that accord with pre- 
Cantorian ideas about sizes of sets. We produced several such principles, consti- 
tuting EXCORE, and found two kinds of interpretations which satisfied these 
principles. In one case, the domains of the interpretations were finite power 
sets. In the other case, the domains consisted of all finite subsets of a given 
infinite set. Insofar as both kinds of interpretation have quantifiers ranging over 
finite sets, we may say that they demonstrate that EXCORE is true when it is 
construed as being about finite sets. 

Our goal is to show that this general theory of set size can be maintained for 
infinite sets as well as for finite sets. We shall show this by constructing a model 
for the general theory whose domain is the power set of the natural numbers. 
But the ability to construct such a model is interesting only to the extent that 
the general theory it satisfies is reasonably adequate. Suppose, for example, that 
we offered as a general theory of size the axioms of CORE other than REP<. 
Call this theory T. So T just says that smaller than is a quasi-linear ordering 
which extends the partial ordering given by the proper subset relation. Since 
any partial ordering can be extended to a quasi-linear ordering, T has a model 
over "P(N). But unless we have a guarantee that the model constructed will 
satisfy, say, DISJ U , the existence of the model does not rule out the possibility 
that T is incompatible with DISJ U . For a particular principle, <f>, in this case 
DISJ U , we may take one of three tacks: (1) add (p to T, obtaining T", and show 
that T' has a model over P(N); (2) show that <f) « s inconsistent with T and argue 
that T is somehow more fundamental or more intuitive than 0; (3) acquiesce 
in ignorance of whether T and are compatible and argue that if they are 
incompatible, then T should be maintained anyway. 

Below, we deal with DISJ U as in (1), since DISJ U is in EXCORE. Cantor's 
principle ONE-ONE is dealt with as in case (2). It seems futile to try to rule 
out the need to resort to the third approach for any cases at all, but we can 
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reduce this need to the extent that we include in our general theory, T, as many 
plausible statements as possible. 

We cannot construct T by taking all statements which are true for finite 
sets. Not only is ONE-ONE such a statement, but using the notion of all 
statements true for finite sets presupposes that we have some idea of the range 
of all statements. To avoid the problems involved in speaking of all statements, 
we might instead settle for all statements in L, where L is some judiciously 
chosen language. To avoid ONE-ONE, T must fall short of the full expressive 
power of the language of set theory. 

Consider, now, the axioms in EXCORE. Other than size relations, these 
axioms involve only boolean operations and inclusion relations among sets. They 
do not use the notions ordered pair, relation, or function. In short, the only set 
theory implicit in these axioms is boolean algebra, or a sort of Venn diagram 
set theory. This is not to say the axioms do not apply to relations, functions, 
or other sets of ordered pairs, but only that they do not refer to these sorts of 
objects as such. 

In the next section, we define a language just strong enough to express 
EXCORE. We then construct a theory by taking all statements in that language 
which are true over all finite power sets. By drawing statements only from this 
relatively weak language, we arrive at a theory which can be satisfied over 
infinite power sets. But, since we include in the language all statements of the 
language which are true over any finite power set, we know that no statement in 
that language can arise as something which ought to be true over infinite power 
sets but might be incompatible with our theory. 

There remains the possibility that we could follow the same strategy with a 
more expressive language, though it would have to remain less expressive that 
the full language of set theory. In fact, such a language can be obtained by 
including a notion of the product of set sizes. This in turn opens the possibility 
of a succession of richer languages and a corresponding succession of stronger 
theories of size. At this point, the possible existence of any such hierarchy is 
sheer speculation; we mention it only to emphasize that no claim is made here 
that we have the strongest possible general theory of set size. 

3.1 CS - The Theory of Class Size 

The theories discussed in this paper will be formulated within first order predi- 
cate logic with identity. To specify the language in which a theory is expressed, 
then, we need only list the individual constants, predicates, and operation sym- 
bols of the language and stipulate the rank, or number of argument places, for 
each predicate and each operation symbol. 

Definition 3.1.1. 

a. Lc, Lc, the language of classes, is the first order language with in- 
dividual constants and I, the one-place predicate, Atom, the two-place 
predicate C, and the two place operation symbols, —, Ci, and U. 



3.1. CS- THE THEORY OF CLASS SIZE 



21 



b. L < , the language of size,, is the first order language with the one- 
place predicate Unit, the two-place predicates < and C, and the three place 
predicate Sum. 

c. Lc< , the language of class size, is the first order language containing 
just the non-logical constants in Lc and L < . 

Following the strategy outlined above, we define the theory of class size in 
terms of interpretations of Lc < over finite power sets. 

Definition 3.1.2. 

a. If Lc C L, then A is a standard interpretation of L iff 

(i) A = V(x), for some x, and 

(ii) A assigns the usual interpretations to all constants of Lc: 

A(T) = x, 
AQb) = 0, 
A\= a C b iff a C b 

b. If A is a standard interpretation and A = V(x), then x is the basis of A, 
B(A). 

Definition 3.1.3. A is a standard finite interpretation of Lc < iff 

a. A is a standard interpretation of Lc <; 

b. A has a finite basis, and 
c. 

A\= a < b iff \a\ < \b\, 
A 1= a ~ b iff \a\ = \b\, and 
A N Unit(a) iff \a\ = 1 

Definition 3.1.4. CS, the theory of class size, is the set of all sentences 
°f Lc< which are true in all standard finite interpretations of Lc < . 

By drawing only on principles which can be stated in Lc< we have at least 
ruled out the most obvious danger of paradox. Since the notion of one-to- 
one correspondence cannot be expressed in this language, Cantor's principle 
ONE-ONE will not be included in the theory CS, even though it is true over 
any finite power set. 

Since CS has arbitarily large finite models, it has infinite models. It is not 
obvious that CS has standard infinite models, in which the universe is an infinite 
power set. In Chapter 6 we show that such models do exist. 

The present chapter is devoted to getting a clearer picture of the theory CS. 
Section 2 develops a set of axioms, CA, for CS. Section 3 outlines the proof 
that CA does indeed axiomatize CS. This proof is presented in Chaper 5, after 
a slight detour in Chapter 4. 
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3.2 CA - Axioms for CS 

Here we shall develop a set of axioms, CA, for the theory CS. This will be done 
in several stages. 

3.2.1 BA- Axioms for atomic boolean algebra 

We'll begin with the obvious. Since all of the universes of the interpretations 
mentioned in the definition of CS are power sets, they must be atomic boolean 
algebras and, so, must satisfy BA: 

Definition 3.2.1. BA, the theory of atomic boolean algebras, is the the- 
ory consisting of the following axioms: 

x U y = y U x 
x (~1 y = y n x 
iU(yUz) = (iUt/)Uz) 

x n (y n z) = (x n y) n z) 

x n (y U z) = (x n y) U (y D z) 
x U (y n z) = {x U y) n (y U z) 
x n (7- x) = 
x U (I- ar) = I 
x C y <-> (x U y) = y A x ^ y 
Atom(x) <-> (Vy)(y C a; — > y = 0) 
x ^ — > (3y)(>4tom(y) A (y C xV y = x) 



These axioms are adapted from [Monk, Def 9.3, p. 141 and Def. 9.28, p. 
151]. 

BA is clearly not a complete axiomatization of CS, since BA does not in- 
volve any size notions. But BA does entail all the sentences in CS which do 
not themselves involve size notions. To show this we need to draw on some 
established facts about the complete extensions of BA (in the language Lc)- 
The key idea here is that complete extensions of BA can be obtained either by 
stipulating the finite number of atoms in a model or by saying that there are 
infinitely many atoms. 

Definition 3.2.2. For n > 1, 

a. ATLEAST n is a sentence which says that there are at least n atoms: 

3xi..3x n (Atom(xi) A ... A Atom(x n ) A { ^ Xj \ < i < j < n}) 

b. EXACTLY n is a sentence which says that there are exactly n atoms: 



ATLEAST n A ^ATLEAST n+1 
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c. INF is a set of sentences which is satisfied in all and only infinite models 
ofBA: 

INF = { ATLEAST n | n > 1 } 

d. BA n = BA; EXACTLY n 

e. BAI= BA U INF 

Fact 3.2.3. 

a. For n > 1, BA n is categorical. \Monk , Cor 9.32, p. 152] 



b. For n > 1, BA n is complete. (Immediate from above) 

c. For n > 1, an n-atom atomic boolean algebra is isomorphic to any standard 



finite interpretation of Lc with an n-element basis. [Monk, Prop. 9.30, 
p. 151] 



d. BA is complete. [Monk, Theorem 21.24, p. 360] 



e. BAI and the theories BA n , n > 1, are the only complete, consistent ex- 
tensions of BA . 



Fact 3.2.4. If BAI V- <j), then (f> is true in some finite model of BA. 
Proof. BA U INF h <f>. By compactness, then, there is a A: such that 

BA U { ATLEAST„ | 1 < n < k } h 
So 4> is true in any atomic boolean algebra with more than k atoms. □ 

Theorem 3.2.5. // CS h (f>, and cj> £ L c , then BA h 4>. 

Proof. If BA Y- <j), then -i<p is true is some atomic boolean algebra, A. 

If A is finite, then A is isomorphic to some standard finite interpration A' 
of Lc- But, then -><f> is true in A', so cf) is not true in A' and cj> £ CS. 

If A is infinite, then ^cf> is consistent with BAI. But BAI is complete, so 

BAI I i(f>. By 3.2.4, -up is true in some finite model A of BA. Hence, -i<p is 

true in some standard finite interpretation of Lc and, again, 4> ^ CS. □ 



3.2.2 Size principles 

Here, we just gather the principles presented above as EXCORE: 



24 



3. A FORMAL THEORY OF CLASS SIZE 



Theory 3.2.6. CORE SIZE consists of the following axioms: 



(SUBSET) 



x C y 
x > y 
x ~ y 



-> x < y 
<-> y < x 
<-> lndisc(x, y) 

< x) 

^ y V x > y 
<-> /4tom(x) 
«-> 3x'3y'( 



(DEF>) 

(DEF~) 

(IRREF<) 

(TRICH) 

(DEF 1 ) 

(DEF+) 



-i(x 
x < y V x 
Unit(x) 
Sum(x, y, z) 



x <~ x' A y <~ y' 
A a;' n y' = 



A a/ U j/' = z) 



(DISJ U ) 



(x ~ x' Ay ~ 
Ax' n y' = 







Ax' U y' = z) 



— > x U y ~ x U y' 



Combining the principles of boolean algebra and the size principles, we ob- 
tain our first serious attempt at a general theory of size: 

Definition 3.2.7. BASIC, the basic theory, is defined as: 



3.2.3 Division principles 

BASIC is not a complete axiomatization of CS. In this section we shall exhibit 
an infinite number of principles which need to be added to BASIC in order to 
axiomatize CS. When we are done, we will have an effective set of sentences, 
CA, the Class Size Axioms, though we will not prove that CA = CS until 
Chapter 5. 

To show that the new principles really do need to be added, we'll need some 
non-standard models of BASIC. These models will be similar in that (l)their 
universes will be subsets of 'P(N), (2) their atoms will be singletons in V{H), 
and (3) all boolean symbols will receive their usual interpretations. The models 
will, however, include different subsets of N and assign different size ordcrings 
to these sets. 

In Chapter 6, these models of BASIC will reappear as submodels of various 
standard models of CS over V{H). So, in addition to the immediate purpose of 
establishing independence results, these models provide a glimpse of how sets 
of natural numbers are ordered by size. 

Every standard finite interpretation, A, of Lq< satisfies exactly one of the 



BA U SIZE 
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(EVEN) 3x3y(x ~yAxny = 0AxUy = I) 

3x3y3y(x ^y/\xC\y = xC\z = yC\z = $ 
AAtom(z) AiUyUz = I) 



(ODD) 



A N EVEN if \B{A)\ is even and A N ODD if \B(A)\ is odd. But BASIC V- 
(EVEN V ODD). Consider the model T whose universe consists of all only the 
finite and cofinite subsets of N, where, for a,b 6 T: 

T 1= (a < b) iff a and b are both finite and \a\ < \b\ 

or a and b are both cofinite and |N — a\ > |N — b\ 
or a is finite and b is infinite. 



T is a model of BASIC. But neither EVEN nor ODD is true in T ', for any 
two sets that are the same size are either both finite, in which case their union 
is also finite, or both cofinite, in which case they cannot be disjoint. 

So EVENVODD is in CS, but not entailed by BASIC. As you might suspect, 
this is just the tip of the iceberg of principles missing from an axiomatization of 
CS. Informally, we can extend T to a model that satisfies EVEN by including 
the set of even numbers and the set of odd numbers and making them the same 
size. To round out the result to a model of BASIC, we also need to include all 
sets which are near the set of evens or the set of odds, i.e. those that differ 
from the evens or odds by a finite set. With these additions made, the new 
model will be closed under boolean operations and will satisfy BA. There is 
a (unique) way of ordering these added sets by smaller than that will satisfy 
BASIC: rank them according to the size and direction of their finite difference 
from the odds or the evens. So, we can construct an infinite model of BASIC ; 
(EVEN V ODD). 

But this model will still not satisfy CS, as we can see by generalizing the 
argument above. (EVENVODD) says that the universe is roughly divisible by 
two: EVEN says that the universe is divisible by two without remainder. ODD 
says that there is a remainder of a single atom. We can construct a similar 
statement that says the universe is roughly divisible by three — with remainder 
0, 1, or 2. As with (EVEN V ODD), this statement is in CS but is satisfied by 
neither our original model nor the model as amended. Again, we can extend 
the model and again we can produce a statement of CS which is false in the 
resulting model. 

We now formalize this line of reasoning. 



Definition 3.2.8. If < m < n, then 
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a. MOD n , m is the sentence 

3x 1 ..3x n 3y 1 ...3y m ( 

xi ~ x 2 ~ • • • ~ x n 
A Atom(y 1 ) A ... A Atom(y m ) 

A f\ (ijfl xj = 0) 

l<i<j<n 

a A (y< n w = ) 

i<«<i<rn 

A (xi U . . .x„) n (yi U .. .y m ) = 
A (xi U . . .x„) U (yi U...y m ) = J) 

MOD n ^ m says that the universe is divisible into n sets of the same size 
with m atoms remaining. 

b. DIV n is the sentence 

MOD nfi V ... V MOD„ ;n _! 

Fact 3.2.9. If < m < n and A is a standard finite interpretation of Lc<, 
then 

(a) A N MOD n . m iff \B{A)\ = m mod n 

(b) A N DIV n 

(c) CS h DIV n 

Theorem 3.2.10. BASICS CS 

Proof. If n > 1, BASIC K DIV„. The model, ^ defined above satisfies BASIC 
but not DIV n , for any n finite sets have a finite union and any two cofinite sets 
overlap. So, BASIC Y- CS, by 3.2.9c. □ 

We could consider adding all DIV„ sentences to BASIC in the hope that this 
would yield a complete set of axioms for CS. We did consider this, but it does 
not work. To demonstrate this, we need some independence results for sets of 
DIV„ sentences. 

Definition 3.2.11. If J is a set of natural numbers, then 

a. DIVj = { DIV n n G J } 

b. BDIVj = BASIC U DIVj 

c. BDIVj = BDIV {j} 
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Our independence results will be obtained by constructing models of BASIC 
which satisfy specific sets of DIV sentences. To build such models from subsets 
of N, we shall include sets which can be regarded as fractional portions of N. 

Definition 3.2.12. For n>0, 

a. x is an n-congruence class iff x = [nk + to] for some m, where < 
to < n. 

b. x is an n- quasi- congruence class iff x is the union of finitely many 
n-congruence classes. 

c. x is a congruence class iff x is an n-congruence class for some n. 

d. x is a quasi- congruence class iff x is an n- quasi- congruence class for 
some n. 

e. QC n = { x | x is an n- quasi- congruence class } 

/• QC =U n>0 QC n 

Examples. 

a. The set of evens, [2n], and the set of odds, [2n+l], are 2-congruence 
classes. 

b. [3k + 2] is a 3-congruence class. 

c. N is a 1- congruence class. 

d. N is an n- quasi- congruence class for every n > 0: 

N = [nk + 0] U . . . U [nk + n - 1] 

Fact 3.2.13. 

a. If x e QC n and y <G QC n , then x U y G QC n . 

b. IfxE QC n , then N - x e QC n - 
Proof. 

a. Suppose x = a\ U . . . and y = b\ U . . . bj. Then x U y = a\ U . . . dfc U b\ U 
...bj. 

b. N itself is the union of n-congruence classes. If x is the union of m of these 
classes, then N — x is the union of the remaining (n — to) n-congruence 
classes. 

□ 

Definition 3.2.14. x is neary, NEAR(x,y), iffx — y andy — x are both finite. 
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Fact 3.2.15. x is near y iff there are finite sets, W\ and w 2 such that x = 
(y U tui) - w 2 . 

Proof. 

=> Let w; i = x — y and let w 2 = y — x. 

<= Suppose x = (y U w\) — w 2 . Then x — y C w\ and j - i C hj 2 , so i - t/ 
and y — x are finite. 

□ 

Fact 3.2.16. If X\ C x C x 2 , xi is near y ; ararf x 2 is near y, then x is near y. 

Proof. Since x C x 2 , (x — y) C (x 2 — y). But (x 2 — y) is finite, so (x — y) is 
finite. 

Since X! C x, {y — x) C (y — xi). But (j/ — Xi) is finite, so (y — x) is finite. □ 

Fact 3.2.17. NEAR is an equivalence relation. 
Proof. 

a. x is near x, since x — x = 0, which is finite. 

b. If x is near y, then y is near x. Immediate. 

c. Suppose x is near y and y is near z. Note that 

(2 — x) = ((z n y) — x) U ((z — y) — x) 

But (z — y) — x is finite because (z — y) is finite and ((z n y) — x) is finite 
because ((z n y) — x) C (y — x), which is finite. So the union, (z — x), is 
finite. 

Similarly, 

(x- z)= ((x n y) - z) U ((x - y) - z) 

where ((xfly) — z) C (y — z) and ((x — y) — z C (x — y). So (x — z) is also 
finite. 

Hence, x is near z. 

□ 

Fact 3.2.18. 

a. If xi is near x 2 , then x\Uy is near x 2 U y. 

6. //xi is near x 2 and yi is near y 2 , i/ien xi U yi is near x 2 U y 2 . 

c. // x is near y, then N — x is near N — y 
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Proof. 

a. Since x\ is near x 2 , x\ — X2 and X2 — x\ are finite. But 

(xi U y) - {x 2 U y) C (xi - x 2 ) 
and (x 2 U y) - (xi U y) C (x 2 - aci) 

b. By (a), xi U j/i is near x 2 U yi , which is near x 2 U y 2 . So xi U yi is near 
x 2 U j/2 , by transitivity. 

c. (N — x) — (N — y) = y — x and (N — y) — (N — x) = x — y. So if x and y 
are near each other, so are their complements. 

□ 

We can now define the domains of the models we will use to establish the 
independence results. 

Definition 3.2.19. 

o- Qn = { y I y is near an n- quasi- congruence class } 

b - Q = U„>o Qn 
Examples. 

Q" Qi = {y C N | y is finite or cofinite }. 

b- Qi = {y C N | y is finite, cofinite, near [2n] or near [2n + 1] }. 
Qi is the domain of the model constructed above to satisfy DIVi. 

Fact 3.2.20. If A is a class of sets such that 

a. \JA e A, 

b. If x G A, then (J A - x G A, and 

c. If x £ A and y G A, then iUj;£4, 

then A forms a boolea n algeb ra under the usual set-theoretic operations, where 



I is interpreted as A. [Monk, Def. 9.1, p-Hl and Corr 9.4, P-14%] 



Theorem 3.2.21. If n > 0, then Q n forms an atomic boolean algebra under 
the usual set-theoretic operations. 

Proof. Using Fact 3.2.20: 

a - U {Qn} = N, since N G {Q n } and if x G Q n , then iCR 
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b. If x E Q n , then 



Suppose 
and 



x is near y 



y e QC n 

N - y e QC„ 

N — x is near N — y 

N-x e Q n . 



So 



by 3.2.13b 
by 3.2.18b 



and 



So 



c. 



Suppose x £ Q n 

and y eQ n 

then x is near x' 

and a/ G QC„ for some x' 

and y is near y 

and y' £ QC n for some y' 

But then x' U y' £ QC n by 3.2.13a 

and x U y is near x' U y' by 3.2.18b 

So iUj/G QC„ 



Thus, Q „ is a boolean algebra. Moreover, every singleton is in Q n since all 
singletons are near 0. So Q n is an atomic boolean algebra. 

We now define a size function on all sets which are near quasi-congruence 
classes. The sizes assigned to sets are ordered pairs. The first member is a 
rational between and 1 which represents the density of the set. The second 
member is an integer which represents the finite (possibly negative) deviation 
of a set from average sets of the same density. First, we define the ordering and 
arithmetic for these sizes with the intention of inducing the size ordering and 
Sum relation for sets from the assignment of sizes to sets. 

Definition 3.2.22. 

a. A size is an ordered pair (p, S) , where p is rational and S is an integer. 

b. If 0i = (pi, Si) and 9 2 = (p2,S 2 ) are sizes, then 

(i) 0i < 6» 2 iff pi < p 2 or (pi = p 2 A Si < 5 2 ). 

(ii) 8i+8 2 = (pi +p 2 ,5i +5 2 ) 

Only some of these sizes will actually be assigned to sets. Specifically, a size 
will be assigned to a set only if < p < 1. Moreover, if p — 0, then S > and 
if p= 1, then 5 < 0. 



□ 
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Our intention in assigning sizes to sets is as follows: Suppose x is near an 
n-quasi-congruence class, x' . So x' is the union of k < n n-quasi-congruence 
classes. The set x' has density k/n and this is the value, p, assigned to x. The 
S value assigned to x is the finite number of elements added to or removed from 
x' to obtain x. 

The definitions and facts below formalize this intention and demonstrate 
that the assignment of sizes to sets is well-defined. 

Fact 3.2.23. 

a. If x G QC, y G QC , and x ^ y, then x is not near y. (That is, no two 
quasi- congruence classes are near each other.) 

b. Any set is near at most one quasi- congruence class. 
Proof. 

a. Let n be the least number such that 

x G QC n and y G QC n 
So each is a disjoint union of n-congruence classes: 



Suppose 


a G x — y 


then 


a G Xi for some i. 


But 


a ^ yj, for any y. 


So 


Xi ^ y j: for any j 


So 


Xi Ci -y 


So 


x — y is infinite, since Xi is infinite 



Similarly, if a G y — x, then y — x is infinite. 

b. If x were near two quasi-congruence classes, the two would have to be near 
each other, since NEAR is transitive. But this is impossible by (a). 



and y 



x 



iiU...Uij 
2/1 U . . . U y k 



□ 



Definition 3.2.24. If x is near a quasi-congruence class, then 



a. 



C{x) is the quasi-congruence class near x. 



c. 



b. 



Ai(x) = x- C{x). 
A 2 (x) = C(x)-x. 
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Ai(x) and A 2 (x) are finite and 



x={C{x)UA 1 (x))-A 2 (x) 



Definition 3.2.25. If x G QC, then 

a. a(x) = the least n such that x G QC n . 

b. [3{x) = the unique k such that x is the disjoint union of k a(x) -congruence 



Examples. 

a. Ifx= [2n+ 1], a(x) = 2 and (3{x) = 1. 

b. Ifx= [4n + 1] U [4n + 2], a(x) =4 and f3{x) =2. 

c. If x = [4n+ 1] U [4n + 3], a(x) = 2 and (3{x) = I, since x = [2n+ 1]. 

Definition 3.2.26. Ifx&Q, then 



We can, at last, define the models to be used in our independence proof. 

Definition 3.2.27. For n > 0, Q n is the interpretation Q of L c< in which 
boolean symbols receive their usual interpretation and 



Q = Qn 

Q^x<yiff8{x)<8(y) 
QN x ~ y iff 9(x) - %) 
Q \= Unit(x) iff0{x) = (0,1) 
Q N Sum(x, y, z) iff 9{z) = 6{x) + 6{y) 



To show that the models Q n satisfy BASIC, we will need the following facts 
about congruence classes. 

Fact 3.2.28. 

a. If x = [an + b] and a 2 = ac, then 



classes. 



p(x) 



P(C(x)) 
a(C(x)) 



S(x) 
9(x) 



|Ai(x)|-|A 2 (x)| 
(p(x),S(x)) 



x = 



(J [a 2 n+(ia + b)} 



b. If x € QC n and m — kn, then x G QC, 
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c. If x € QC and y £ QC, there is an n such that x £ QC n and y £ QC n . 
Proof. 

a. If k £ [a 2 n + (za + b)] for some i, < i < c, then, for some m, 

fc = a 2 ni + ia + 6 
= a(cni) + za + b 
— a{cn\ + i) + b 

So k £ x. 

If A; £ x, there is an m such that k = an\ + b. Let n 2 be the greatest n 
such that a 2 n < k and let fc' = k — a 2 n 2 . 

Since fc = b mod a 

and a 2 n 2 = mod a 

then k' = b mod a 

So k' = ia + b, where < i < c 

But k = a 2 n 2 + k' = a 2 n 2 + ia + b 

So k £ [a 2 n + (ia + 6)] 

b. By (a), each n-congruence class is a disjoint union of m-congruence classes. 

c. Suppose x £ QC ni and y £ QC n2 . Then, by (b), both x and y are in 

□ 

Theorem 3.2.29. For any n > 0, Q n N S^5/C 

Proof. By 3.2.21, Q„ is an atomic boolean algebra; so Q n \= BA. The <-relation 
of Q n is induced from the linear ordering of sizes; so it is a quasi-linear ordering 
and IRREF < , TRICH, and DEF~ are satisfied. As for the remaining axioms: 

a. SUBSET: Suppose xdy.li C{x) = C{y) then p{x) = p{y) and A^x) C 
Ai(y) and A 2 (x) C A 2 (y), where at least one of these inclusions is proper. 
So 5{x) < 5(y). 

But if C(x) ^ C(y), then C(x) c C(y), so p(a;) < p(y). 
In either case, 9(x) < 9(y), so Q„ \= x < y. 

b. REP<: Suppose Q„ 1= x < y. So 9(x) — {k\/n,8\), 9{x) = (k 2 /n,S 2 ), and 
either ki < k 2 or <5i < 5 2 . 

We want to find some i'cj such that Q n 1= x ~ x'. 

If ki = k 2 > 0, then y must be infinite; so, x' can be obtained by removing 
5 2 — Si atoms from y. 
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If k\ = k 2 = 0, then < Si < S 2 ; so, again, x' can be obtained by removing 
5 2 — Si atoms from y. 

If k 2 > ki > 0, then let yi be the union of ki n-congruence classes 
contained in C(y). So y — y\ is infinite and y\ — y is finite. Let y 2 = 
Vi - (yi-y) =Vi^V- So 9(y 2 ) = (h/n, -S 3 ) where <5 3 = \yi-y\. Finally, 
let 64 = S 3 + Si and x' — y 2 U y 3 , where yz^yi-y with |j/ 3 | = 5 4 . 

If k\ = 0, then a; is finite. If k 2 > 0, then y is infinite, so there is no 
problem. If k 2 = 0, then y is finite, but has more members than x, since 
Si < S 2 . So, let x' be any proper subset of y with Si members. 

DISJ U : It is enough to show that if x and y are disjoint, then 9{x U y) = 
6{x) + 6{y). We need the following three facts: 

(i) C{x Uy) = C(x) U C(y). (See Fact 3.2.18b.) 

(ii) Ai(a; Uy)= (Ai(ar) U Ai(y)) - (C(ar) U C(y)). 

If z e 1U1/ but a ^ C(a; U y), then a e Ai(a;) or a £ Ai(y); any 
element of Ai(x) is also in Ai(xUy) unless it is in C(y); any element 
of Ai(y) is also in Ai(a; U y) unless it is in C(x). 

(iii) A 2 (x Uy)= (A 2 (a:) U A 2 (y)) - (Ai(a;) U Ai(y)). 

Note that if x e Q, y G Q, and x n y = 0, then C(x) n C{y) = 0; 
otherwise, C(x) and C(y) have an infinite intersection. 



Hence, if a G C(x) U C(y) - (x U y) 

then a G C(x) — x = A 2 (x) 

or a G C(y) - y = A 2 (y) 

And if a G A 2 (x), then a G A 2 (x U y) 

unless a G Ai(y) 

And if a G A 2 (y), then a G A 2 (x U y) 

unless a G Ai(x) 



From (ii) we obtain (iia): 

(iia) |A!(xUy)| = |A 1 (x)UA 1 (y)| 

-|(A 1 (x)UA 1 (y)n(C(x)UC(y))| 

and from (iii) we obtain (iiia): 

(iiia) |A 2 (xU2/)| = |A 2 (x)UA 2 ( 2 /)| 

- |Ar(x) U Ai(y) n (A 2 (x) U A 2 (y))| 
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But Ai(x) and Ai(y) are disjoint, since x and y are disjoint. So: 



(iv) lAxOr) U Aid,)! = |Ai(x)| + |Ax(y)| = ft (a) + ft(y) 



Since A2(x) and A 2 (y) are contained, respectively, in C(x) and C(j/), 
which are disjoint, they are also disjoint. So: 



Since p(x U y) = p{x) + p(y), by (i), we know that 9(x U y) = 9(x) + 9(y). 



Suppose Q n N Sum(x,y, z) 

So 9(z) = 9(x) + 6(y) 

Clearly 6{x) < 9(z) 

Assume 9(x) = 9(z) 

then %) = (0,0) 

So y = 

Let x' = z, y' = to satisfy DEF+ 

Assume 0(x) < 9(z) 

then Q„ N x' ~ x 

and x' C z for some x' since Q n \= REP < 

Let y' = z — x' 

So z = x' Wy' 

Claim 0(i/) = %') so Q n N y ~ y' 

For 6>(x') + 0( ? /) = 9(z) by DISJ U 

But 9{x) = 9{x) 

So 0(2/') = 9(z) - 9(x) = 9(y) 

(Cancellation is valid for sizes because it is valid for rationals and integers.) 
Conversely, 



|A 2 (x)uA 2 (y)| = 5 2 (x) + 5 2 (y) 



So 



S 1 (xUy)-S 2 (xUy) 



= {5 1 (x) + 5 1 (y))-(5 2 (x) + 5 2 (y)) 
= (6 1 (x)-S 2 (x)) + (6 1 (y)-8 2 (y)) 
= S(x)+S(y) 



d. DEF+: 



then 



and 



and 



If 



9{z) = 9{x') + 9{y') 
9{x') = 9{x) 
0{y') = 0(y) 
9(z) = 9{x) + 9{y) 
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□ 

Theorem 3.2.30. For any n > 0, Q n \= DIV m iff m | n. 
Proof. 

=> 0(N) = (1,0). Hence, if to disjoint sets of the same size exhaust N, they 
must each have size (1/m, 0). But if x e Q n , then 9{x) = (a/n,b), for 
integral a and b. So b — and a = n/m. 

<= For each i, < i < n, let A l = [nk + i]. So N = Uo<i<n 

If i ^ j, then (~l Aj = and Q n 1= (A, - Aj) since 6»(Aj) = fl(Aj-) = 
(l/n,0). 

Letting p = n/m, group the n sets Ai into m collections with p members 
in each: 

B\ , . . . , B m 

Letting bj = (J 2?^ for 1 < j < m, we have 6j S Q„ and 0(bj) = (p/n, 0) — 
(l/m,0). 

Furthermore, 6i U . . . fe m = N. 

□ 

Definition 3.2.31. If J ^ anrf J is finite, then the least common multiple 

of J, u(J), is the least k which is divisible by every member of J . 

Remark. u( J) always exists since the product of all members of J is divisible 
by each member of J. Usually, the product is greater than u(J). 

Corollary 3.2.32. If J is finite, then 

a. If BDIV n h DIV m , then m | n. 

b. IfBDIVj h DIV m , then m \ fi(J). 

c. There are only finitely many m for which BDIVj h DIV m . 
Proof. Based on Theorem 3.3.20: 

a. If to \ n, then Q n N BDIV„; -DIV m 

b. Q^j) 1= BDIVj since it satisfies DIV, for each j e J. But, if m \ u(J), 
then Q KJ) P DTVm- 

c. Immediate from (b), since only finitely many to divide u(J). 

□ 
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We are now ready to show that BDIV„ Y CS by finding a sentence in CS 
which entails infinitely many DIV„ sentences. Such sentences can be produced 
by generalizing the notion of divisibility to all sets instead of applying it only 
to the universe. 

Definition 3.2.33. 

a - IfO< n, then Times n (x,y) is the formula: 

3x . . . 3x n [x = A x n = y A f\ Sum(xi-i,x,Xi)] 

l<i<n 

Times n (x,y) says that y is the same size as the Sum of n sets, each the 
same size as x. 

b- If < m < n, then Mod n ^ m (z) is the formula: 

3x3y3v3w[TimeSn{x, v) A Unit(y) A Times m (y, w) A Sum(v, w, z)] 

Mod n m (z) says that z can be partitioned in n sets of the same size and 
m atoms. 

c. Div n (z) is the formula 

Mod nfi (z) V ... V Moc/ n! „_i(z) 

d. ADIV n is the sentence 

VxDiv n (x) 

Remark. We have taken this opportunity to formulate the divisibility predicates 
purely in terms of size predicates. Notice that in the presence of BASIC, 

MOD n , m = Mod ntm (I) 
and DIV n = Div n (I) 

Fact 3.2.34. CS h ADIV n , for every n. 

Proof. Every set in every standard finite interpretation is a finite set, and all 
finite sets are roughly divisible by every n. □ 

Fact 3.2.35. BASIC; ADIV n h: 

a. ADIV n m , for all m 

b. DIV n , and 

c. DIV n m , for all m. 
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Proof. 

a. By induction on m: if m = 1, then n m = n, so ADIV„ h ADIV„™. 

If T h ADIV n fc, .ANT, and x € A, then a; can be partitioned into n k 
sets of the same size and i atoms, where i < n k . Each non-atomic set in 
the partition can be further partitioned into n sets of the same size and 
jatoms, where j < n. Thus, we have partitioned x into n n sets of the 
same size and n k j + i atoms. But n k n = n k+1 and, since i < n k and 
j < n, n k j + i < n k+1 . Hence, A N ADIV„fe+i. 

b. Obvious. 

c. Immediate from (a) and (b). 

□ 

Theorem 3.2.36. BDIV N P ADIV n for any n £ 1. 

Proof. If BDIVn V~ ADIV n , then by compactness there is a finite set J such that 
BDIV./ h ADIV„. But then BDIV./ h DIV„ fc for every k, by Fact 3.2.35c. But 
this contradicts Fact 3.2.32, which says that BDIVj entails only finitely many 
DIV„ sentences. □ 

So, even if we add all of the DIV sentences to BASIC, we are left with a 
theory weaker than CS. Since this weakness has arisen in the case of ADIV 
sentences, it is reasonable to attempt an axiomatization of CS as follows: 

Definition 3.2.37. CA = BASIC'U { ADIV n \ n > } 

The remainder of this chapter and the next two are devoted to showing that 
CA is, indeed, a complete set of axioms for CS. 

3.3 Remarks on showing that CA axiomatizes 
CS 

We know that CS h CA and we want to show that CA = CS, i.e. that CA h CS. 
To do so, it will be sufficient to show that every consistent extension of CA is 
consistent with CS. 

Fact 3.3.1. 

a. (Lindenbaum's lemma) Every consistent theory has a consistent, complete 
extension. 

b. If every consistent, complete extension of T 2 is consistent with T\, then 
T 2 \~T 1 . 



Proof. 
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a. pong, Theorem 11.13, p.200] 



b. Suppose that T\ h 4>, T2 Y- <f>. Then T — T2;^4> is consistent. T has a 
consistent, complete extension, T", by Lindenbaum's lemma. Since T2 C 
T, T' is also a consistent, complete extension of Ti- But T" is not consistent 
with T\. 

□ 

Definition 3.3.2. T' is a completion of T iff T' is a complete, consistent 
extension of T . 

To prove that every completion of CA is consistent with CS, we define two 
kinds of completions of a theory. 

Definition 3.3.3. 

a. T' is a finite completion of T iff T' is true in some finite model of T . 

b. T' is an infinite completion of T iff T' is true in some infinite model 
ofT. 

Fact 3.3.4. If T' is a completion of T and BA C T, then (T' is a finite com- 
pletion of T iff T' is not an infinite completion ofT.) 

Proof. 

=>■ Suppose A is a finite model of T' with n atoms. Since T' is complete, 
T V EXACTLY„. So V V -iATLEAST n+ i and has no infinite models. 

<= T h ATLEAST„ for every n, so V has no finite models. 

□ 

Fact 3.3.5. 

a. Every finite completion of CA is equivalent to CA;EXACTLY n , for some 
n . 

b. Every finite completion of CA is consistent with CS. 
Proof. 

a. CA h BASIC and, by Theorem 2.1.6, BASIC is categorical in every finite 
power. 

b. The model A of CA; EXACTLY„ is a standard finite interpretation. So 
A N CS. 

□ 
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Definition 3.3.6. 

a. CAI = CAU INF 

b. CSI= CSUINF 

So, to show that every completion of CA is consistent with CS, we may now 
concentrate on showing that every completion of CAI is consistent with CSI. 

What, then, are the completions of CAI? Recall that CA entails DIV n for 
every n > 0, where DIV„ is 

MOD„, V ... V MOD n , n _! 

Any completion, T, of CAI has to solve the disjunction DIV„ for each n; T 
has to entail one of the disjuncts. Remainder theories specify, for each n, the 
number of atoms remaining when the universe is divided into n disjoint sets of 
the same size. 

Definition 3.3.7. Remainder functions and remainder theories 

a. f : N + —> N is a remainder function iff < f(n) < n for all n G 
Dom(f). (Henceforth f ranges over remainder functions.) 

b. f is total iff Dom(f) = N + ; otherwise f is partial. 

c. f is finite iff Dom(f) is finite. 

d. n is a solution for f iff for any i € Dom(f), n = f(i) mod i. 

e. f is congruous iff for any i,j 6 Dom(f), then gcd(i,j) | /(£) - f(j); 
otherwise, f is incongruous. 

f. The remainder theory specified by f , RTf , is the set of sentences { MOD n>m \ f(n) = m }. 

g. If T is a theory, Tf = T U RTf. 

Chapter 5 will show that if / is total, CAI/ is complete and that these are 
the only complete extensions of CAI. In this section, we will show that CAI/ is 
consistent just in case CSI/ is consistent. 

Fact 3.3.8. 

a. If f is finite, the n f has a solution iff f is congruous iff f has infinitely 
many solutions. ; Griffin , Theorem 5-11, p. 80] 

b. f is congruous iff every finite restriction of f is congruous. 

c. There are congruous f without any solutions. (Let /(p) = p — 1, for all 
primes p. Any solution would have to be larger than every prime.) 



Theorem 3.3.9. 
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a. If f is finite, then CSf is consistent iff f is congruous. 

b. CSf is consistent iff f is congruous. 

c. CSIf is consistent iff f is congruous. 
Proof. 

a. (=4>)Let <f> — /\RT/, Since CS;</> is consistent, there is some n such that 
A n 1= <fi. So n is a solution of / and, hence, / is congruous by 3.3.8a. 

(<^)If / is congruous, / has a solution, n. So A n N CS; <f> 

b. (=^>)For every finite restriction, g, of /, CS 9 is consistent. By (a), each 
such g is congruous. Hence, / is congruous by 3.3.8b. 

(<J=)Every finite restriction, g, of / is congruous. So CS S is consistent, by 
(a). By compactness, then, CS/ is consistent. 

c. (=>)If CSI/ is consistent, so is CS/. So, by (b), / is congruous. 

(<S=)By compactness, it is sufficient to show that every finite subtheory, T, 
of CSI/ is consistent. But, if T is such a theory, then 

T C CS g U { ATLEASTi i < n} 

for some n and some finite restriction, g, of /. Since / is congruous, g is 
as well, by 3.3.8b. So g has arbitrarily large solutions and CS S has finite 
models large enough to satisfy T. Hence, T is consistent. 

□ 

We now want to prove a similar theorem for CA, our proposed axiomatization 
of CS. To do this, we must first establish that certain sentences are theorems 
of CA. 

Lemma 3.3.10. If n \ m, < q < n, and p = q mod n, then 

CA h MOD m!P MOD m . q . 

Proof. Suppose A 1= MOD miP , k\ — m/n, and p — k 2 n + q. So, B(A) can be 
partitioned into m sets of the same size 



bis. 


■ ■ b 1>n 




■ ■ b 2 ,n 


b k u l ■ 


■ ■ bk u n 



and p atoms 



01,1 • 


■ ■ ai,n 




■ ■ «2,n 


0*2,1 • 


■ • &k2,n 


Cl . 


C Q 
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For 1 < i < n, let 

B * = (J b i- n U U a i' n 

l<j<ki l<j<k 2 

Since A \= DISJ U , A 1= Bi ~ Bj, for 1 < i, j < n. Furthermore, 

l<i<n 

So, A N MOD n , g . □ 

Lemma 3.3.11. If < p < q < m, then 

CA h MOD m , p -^MOD m . q 
Proof. Suppose „4 1= MOD m , p A -.MOD mi9 . Then: 

.4 1= x\ U . . . x m U ai U . . . a p = I 

A\=y 1 U...y m Ub 1 Ll...b q = I 

where the a's and 6's are atoms and the x's (y's) are disjoint sets of the same 
size in A. Let 



X 


= X! U • • 


U x m 


Y 


= yi u • ■ 


■ U y m 


A 


= ai U • • 


• U a p 


B 


= 6i U • • • 


(J bp 


B' 


= u 


■ ■ ■ U 6, 



We claim that t/i < #1. For, if yi ~ xi, then X U A ~ Y U B and if £1 < yi, 
then lUi<yuB; neither is possible since Y(JBcI = Xl)a. So y» < Xi 
for 1 < i < m. Since ^4 1= REP < , there is a proper subset y[ of Xi which is the 
same size at yi. 

Let Zi = Xi - y[ 

and Y' = y'i U • • • U y' m 

So Y'\JZ = X = I-A 
and Y' U B' = I - B. 

But A ~ £>, since each is the disjoint union of p atoms. Thus (7 — A) ~ 
(I -B), by RC~, so y'UZ~yuB'. But Y' ~ Y, since for each component 
of Y, there is a component of Y 7 of the same size. So Z ~ B', by RC~. 

But Z must be larger than B', for £?' is the union of fewer than m atoms 
while Z is the union of m non-empty sets. So, the original supposition entails a 
contradiction. 

□ 
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Theorem 3.3.12. 

a. If f is finite, then CA / is consistent iff f is congruous. 

b. CAf is consistent iff f is congruous. 

c. CAIf is consistent iff f is congruous. 
Proof. 

a. (=>)Supposing that / is incongruous, there exist i, j, and k such that 
k = gcd(i, j) and k \ (f(i) — f(j))- We will show that 

(*) CA h -.(MOD ii/(i) AMOD iJ(j) ) 

from which it follows that CA/ is inconsistent. 
Let p and q be such that 

< P, q < k, 
f(i) = p mod k, and 
f(j) = q mod k 

By lemma 3.3.10, we have 

(1) CA h MOD ii/(i) MOD feiP , and 

(2) CA h MOB j>f(J) MOD fc!9 

since k \ i and k \ j. Since k \ f(i) — f(j), p ^ q. So lemma 3.3.11 yields 

(3) CA h MOD feiP -» ->MOD fc;? 

From (1), (2), and (3) we may conclude (*). 
(V)Follows from 3.3.9a since CA/ C CS/. 

b. See proof of 3.3.9b. 

c. See proof of 3.3.9c. 

□ 

Corollary 3.3.13. CAIf is consistent iff CSIf is consistent. 

Proof. Immediate from 3.3.9c and 3.3.12c. □ 

It might help to review our strategy before presenting the difficult parts of 
the proof that CA = CS. The main objective is (1), which follows from (2) by 
3.3.1b. 

(1) CAhCS 
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(2) Every completion of CA is consistent with CS. 

We already know that the finite completions of CA are consistent with CS 
(see 3.3.5c) and that if CAI/ is consistent, then CSI/ is also consistent (see 
3.3.12). So (2) is a consequence of (3). 

(3) If T is a completion of CAI, then T = CAI/ for some total, 
congruous /. 

To establish (3), it is sufficient to prove (4) because every completion of CAI 
entails CAI/ for some total /. 

(4) If / is total and congruous, then CAI/ is complete. 

To prove (4), we invoke the prime model test: If T is model complete and T 
has a prime model, then T is complete (see Appendix, Fact C.3). So (4) follows 
from (5) and (6). 

(5) If / is total and congruous, then CAI/ has a prime model. 

(6) For any /, CAI/ is model complete. 

Finally, since any extension of a model complete theory is also model com- 
plete (see Appendix, Fact C7a), we can infer (6) from (7). 

(7) CAI is model complete. 

So (1) follows from (5) and (7). 

The proof outlined here will be carried out in Chapter 5. But first, we 
consider a simpler theory, PSIZE, which deals only with sizes of sets and ignores 
boolean relations. Chapter 4 formulates PSIZE and establishes that it is model 
complete, a result we need for showing that CAI is model complete. 
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The Pure Theory of Class 
Sizes 

CS is about sets; it makes claims about sets in terms of their boolean relations 
and their size relations. In this chapter, we identify a theory, PCS (the pure 
theory of class sizes), which is not about sets, but only about sizes of sets. Sizes, 
here, are equivalence classes of sets which have the same size. PCS is formulated 
in the language of size relations, L < . 

PCS is worth examining in its own right, for if number theory is the theory 
of cardinal numbers, then PCS is our version of number theory. But our main 
reason for introducing PCS is to aid the proof that CA is model complete. For 
this reason, we only give a sketchy treatment of PCS itself. 

Section 1 defines PCS and develops a set of axioms, PCA, for it, as follows: 
for each model, A, of BASIC, the size model, S4, consists of equivalence 
classes drawn from A under the same size relation; PCS is the set of statements 
true in S_a for any standard finite model, A. PCA consists of a theory, PSIZE, 
which holds in S_a_ whenever A N BASIC and a set of divisibility principles. 

Using some results about model theory in section 2, Section 3 establishes 
that PCA is model complete, the main result of this chapter and the only result 
needed for subsequent proofs. This is done by reducing PCA to the theory Zgm, 
whose models are Z-groups taken modulo some specific element. 

Finally, section 4 indicates how PCA could be shown to axiomatize PCS. 
This method is the same outlined in chapter 3 to show that CA = CS. 

4.1 Size models and PCS 

Definition 4.1.1. Suppose A N BASIC. 

a. If x is a member of A, then cta[x] is the size of x in A: 

o~a[x] = {y I A\= (z~y)} 
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b. S_a_, the size model for A, is the interpretation of L< whose domain is 
| a a [%] I x € A | where 

Sa t= o~a [ x ] ~ o-a[v] iffA\=x~y 
Sa 1= oa[x\ <<7A[y] iff A\=x<y 
S A 1= Sum(aA[x},o- A [y],o-A[z]) iff A\= Sum(x,y, z), and 
S A N Un/t^M) iff A\= Atom(x) 

c. The one-place operator x is to be read as the complementary size of x: 

Sa^ (y = x) iff S A f= Sum(x, y, I) 

These interpretations are well defined because the predicates are satisfied 
by elements of A in virtue of their sizes. For example, if A N Sum(x,y, z) and 
A 1= z ~ z' , then A N Sum(a;, y, z'). 

Definition 4.1.2. PCS, the pure theory of class sizes, consists of all sen- 
tences of L K which are true in the size model of every standard finite interpre- 
tation of Lq < . 



Definition 4.1.3. PSIZE, the theory of sizes, consists of the following ax- 
ioms: 



Order axioms 



(IRREF<) -,(x < x) 

(TRANS <) x<yAy<z^x<z 
(UNIQ~) x ~ y <-> x = y 

(MAX) x < I 

(MIN) < x 

(TRICH) x <yV x ~ yV y < x 



Unit axioms 



Unit(x) <-> (y < x <-> y = 0) 
3x Unit(x) 
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Sum axioms 

(IDENT) 
(COMM) 
(MONOT) 

(ASSOC) 

(EXIST+) 
(EXIST-) 
(COMP) 



Sum(x, 0, x) 
Sum(x, y, z) <-> Sum(y, x, z) 
Sum(x\,y, z\) A Su/n(x 2 , y-z 2 ) — > 

(xi < x 2 <-> zi < z 2 ) 
Sum(x, y, w\) A Sum(w\, z, w) A Sum(y, z, w 2 ) 

— > Sum(x, W2,w) 
3zSum(x,y 1 ,z) Ay 2 < yi ->■ 3zSum(x,y 2 , z) 
x < z — > 3ySum(x, y, z) 
Sum(x, x, I) 



Fact 4.1.4. If A\= BASIC, then S A \= PSIZE 



Fact 4.1.5. PSIZE Y- PCS. 

Proof. PSIZE fails to axiomatize PCS for the same reason that BASIC fails to 
axiomatize CS: the lack of divisibility principles. □ 



We offer PCA as an axiomatic version of PCS: 
Definition 4.1.6. PCA, the pure class size axioms, is defined as: 

PSIZEU{ADIV n I n > 0} 



Fact 4.1.7. If A\= CA, then S A N PG4 

If .4 is a standard finite interpretation of BASIC with n atoms, then the 
elements of S A can be regarded as the sequence 0, . . . , n with the usual ordering, 
where 

S A N Unit(ar) iff x = 1 

and 

S A \= Sum(i,j, k) iff (i + j) = k < n 



Once n is fixed, this is the only interpretation allowed by the axioms PSIZE. In 
particular, MONOT rules out the interpretation in which Sum(i, j, k) is satisfied 
just in case (i + j) = k mod n + 1. 
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4.2 Some models and theories 

We shall use Theorem 4.2.1a to show that PCA is model complete and Theorem 
4.2.1b to show, in chapter 5, that CAI is model complete. 

Theorem 4.2.1. 

a. If T satisfies Monk's Condition, then T is model complete. 

Monk's Condition. If A \= T, B 1= T, A C B, and C is a 

finitely generated submodel of B, then there is an isomorphism, 
f : C —> A such that if x G C PI A, then f{x) — x. f is a Monk 
mapping. 

b. If T is model complete and Lt has no function symbols, then T satisfies 
Monk's Condition. 



Proof. 

a. |Monlj p.359] 



If Lt has no function symbols, then any finitely generated structure over 
Lt is finite. So, suppose C contains a\, . . . ,a n (from .4) and b\, . . . , b m 
(from B-A). Let <pi be the diagram of C and obtain (f>i from </>i by sub- 
stituting the variable Xi for each constant and the variable yi for each 
constant bi. Finally, let fa be 

3yi . . . 3y m <j)2 

So, 03 is a primitive formula. B N 03 (ai, . . . , a„), so A does as well, by 
Fact C.5d in the Appendix. So, to obtain the desired isomorphism, map 
the di's into themselves and map the bi's into a sequence of elements of A 
which can stand in for the existentially quantified variables of tfis. 

□ 



In chapter 5, we use Monk's Theorem to infer the model completeness of the 
theory CA from that of PCA. To establish the model completeness of PCA, we 
use Fact 4.2.2. 



Fact 4.2.2. 



a. If Ti is model complete and T\ h Ti, then T2 is also model complete. 

b. IfT is model complete in L, and L' is an expansion of L by adjoining new 
individual constants, then T is model complete in L' . [Monk, p. 355] 



Definition 4.2.3. Suppose L\ and Li are first order languages and L12 = L\ — 
Li. A translation (simple translation) of L\ into Li is a function, t, which: 
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a. assigns to the universal quantifier a (quantifier free) formula, ry, of Li 
with exactly one free variable. 

b. assigns to each n-place predicate, P, in L 12 a (quantifier free) formula, 
Tp, of Li with exactly n free variables, and 

c. assigns to each n-place function symbol, O, in L\i a (quantifier free) for- 
mula, To, of Li with exactly (n + 1) free variables. 

Definition 4.2.4. If r is a translation of L\ into Li, then r extends to all 
formulae of L\ as follows: 

a. Predicates and function symbols of L 2 are translated into themselves. 

b. 

r(^VV) = t{4>) Vt(V0 

t{4> a ip) — t{4>) a t(V>) 

r(-i^) = -it{4>) 
t(\/x4>) = Vx(ry(x) — ► 4>) 
t(3x<I>) = 3x(ry(x) A <p) 



Definition 4.2.5. If t is a translation of L\ into Li, then 

a. The functional assumptions of r are the sentences: 

Vxi . . . Var n (7v(a;i) A ... A r v (x„)) — > 

3j/i(7v(j/i) A Vy 2 (T (xi, . . .,x n ,y 2 ) <-> yi = yz)) 

where O is a function symbol in L\ but not in L 2 . 

b. The existential assumption of t is 

3xTy{x) 

The functional assumptions of a translation say that the formulas which 
translate function symbols yield unique values within the relevant part of the 
domain when given values in the relevant part of the domain. The relevant 
part of the domain is the set of elements which satisfy the interpretation of 
universal quantifier. The existential assumption of a translation says that that 
subdomain is non-empty. Notice that the existential and functional assumptions 
of a translation are sentences of L 2 . 

A translation from Li into L\ induces a mapping from interpretations of Li 
into interpretations of L\. 
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Definition 4.2.6. If t is a translation from L\ into L 2 and B is an interpreta- 
tion of L2 which satisfies the existential and functional assumptions of r, then 
t(B) is the interpretation A of L\ such that: 

a. A is the set of elements of B which satisfy Ty. 

b. A interprets all predicates and function symbols common to L\ and L 2 in 
the same way that B does. 

c. A interprets all predicates and function symbols in L\ 2 in accordance with 
the translations assigned by r. 

AtP(x) iffB^T P {x) 
A\=y = 0(x) iffB\=ro(S,y) 

The following condition on theories allows us to infer the model completeness 
of one from the model completeness of the other. 

Definition 4.2.7. 

a. If t is a translation from L\ to L2, then T\ is 7 -reducible to T2 iff for 
every model A ofT\ there is a model B of T 2 such that A = t(B). 

b. T\ is reducible (simply reducible) to T 2 iff there is a (simple) transla- 
tion, t, for which T\ is 7 -reducible to T 2 . 

c. Ti is uniformly r-reducible to T 2 iff for any models, A\ and B\, such 
that A\ N T\ and B\ 1= T\, and A\ C B\, there exist models A 2 and B 2 
such that A 2 N T 2 and B 2 1= T 2 , and A 2 QB 2 A\= t(A 2 ) and B\ = t(B 2 ), 

Lemma 4.2.8. Suppose that T\ is T-reducible to T 2 and that A\ = t(A 2 ). 
Then, for any primitive formula, (f>, of Li and any sequence, x s A\, 

Ax N <j>(x) iff A 2 N T (x) 

Proof. Suppose 

(f>(x) =3y 1 ... 3y n <j>'(x) 

where 4>'(x) is a conjunction of atomic formulae and negations of atomic formu- 
lae. Then 

A\ 1= 4>{x) iff At 1= 3yi . . . 3y n (p'(x, yi,...,y n ) 
iff A\ \= <f>'(x, bi, ... , b n ), for bi e A 
iff A 2 \= Tp(x,bi, ...,b n ) 
iff A 2 \=3y 1 ... 3y n 

(tv(j/i) A ... A Ty(y n ) A 4>{x,yi, y„)) 
iff A 2 1= 7 (x) 

□ 
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Theorem 4.2.9. If Ti is model complete, t is a simple translation from Lt x 
to Lt 2 , and T± is uniformly T-reducible to T 2 , then T\ is also model complete. 

Proof. By Fact C.5d in the Appendix, it is enough to show that given models 
Ai and £>i of T\, where A\ C B\ and a primitive formula, (f>: 

If Si N <f>{x) for x e A 
then Ai N cj)(x) 

Since T\ is uniformly r-reducible to T2, there are models of T2, ^2 C S 2 , where 
A 1 =t(A 2 ) and B 1 =t{B 2 ) 

Since Si 1= (/>(x) by assumption 

then B2 N t^(^) by lemma 4.2.8 

So y^2 1= T 0(^) since T 2 is model complete 

and A\ 1= by lemma 4.2.8 

□ 

We shall now define several theories, all more or less familiar, which will 
serve in showing that our theory of size is model complete. 

Definition 4.2.10. 

a. The theory of abelian groups with identity has the following axioms 

(1) x+ (y + z) = (x + y) + z 

(2) x + y = y + x 

(3) x + = x 

(4) 3y(x + y = 0) 

b. The theory of cancellable abelian semigroups with identity consists 
of (1), (2), and (3) above and: 

(4') x + y = x + z — » y = z 

c. The axioms of simple order are: 

(5) x <y l\y < z ^ x < z 

(6) x < y Ay < x — > x = y 

(7) x < x 

(8) x<yVy<x 



d. The theory of 2,-groups, Zg , has the following axioms 
(i) The axioms for abelian groups with identity, 
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(ii) The axioms for simple order, 
(Hi) The following additional axioms: 

(9) y<z^x+y<x+z 

(10) 1 is the least element greater than 

(11) \fx3y(ny = x V . . . V ny — x + (n — 1)) 



for each positive n, where ny stands for: 

y + -y + y 

n times 

e. The theory of N-semigroups has the following axioms: 

(i) The axioms for cancellable abelian semigroups with identity, 

(ii) The axioms of simple order, 
(Hi) Axioms (9) and (10) of Zg, and 
(iv) The additional axiom: 

(12) < x 



f. The theory of Z-groups modulo I , Zgm, consists of the following ax- 
ioms: 

(i) The axioms for abelian groups with identity, 

(ii) The axioms for simple order 

(Hi) Axioms (10) and (11) of Zg, axiom (12) from the theory of ^-semigroups, 
and 

(iv) The following additional axioms 



(13) 
(14) 



(y < z A x < x + z) — » x + y < x + z) 
x < I 



The theory of Z-groups is taken from Chang , p. 291] 
Fact 4.2.11. 

a. Zg is the complete theory of (Z, +, 0, 1, <). ; Chan\ , p. 291] 



b. Zg is model complete. (Robinson] 



Theorem 4.2.12. 

a. The theory of ^-semigroups is model complete. 
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b. The theory of Z-groups modulo I is model complete. 
Proof. 

a. Every abelian semigroup with cancellation can be isomorphically embed- 



ded in an abelian group Kurosh , pp. 44-48]. It is clear from the con- 
struction in [ Kurosh that if the semigroup is ordered, the abelian group 
in which it is embedded may also be ordered and that the elements of 
the semigroup will be the positive elements of the group. Moreover, the 
(rough) divisibility of the elements in the semigroup will be carried over 
to the group. 

Consequently, the theory of N-semigroups is uniformly reducible to the 
theory of Z groups by the translation: 

r v = r < x n 

Since the latter is model complete, so is the former, by 4.2.9. 

b. First, consider the theory of N-semigroups in the language which contains, 
besides the constant symbols in the original theory, an individual constant, 
/. The theory of N-semigroups is model-complete in this language, by 
4.2.2b. 

We claim that the theory of Z-groups modulo / is uniformly reducible to 
this new theory by the following translation: 

T V = r x < r 

t + = r x + y = z\/x + y = I + z^ 

(The construction: given a model of Zgm, stack up u) many copies of the 
model, assigning interpretations in the obvious way. The result is an N- 
semigroup and the original model is isomorphic to the first copy of itself.) 

□ 

In the next section, we use Theorem 4.2.9 to show that PSIZE is model 
complete, by reducing it to the theory of Z-groups modulo /. Chapter 5 uses 
Monk's Theorem to show that CA is model complete. 



4.3 PCA is model complete 

To show that PCA is model complete, we shall reduce it to Zgm, the theory of 
Z-groups with addition taken modulo some constant (see 4.2.11). The model 
completeness of PCA then follows from the model completeness of Zgm (Fact 
4.2.12) and Theorem 4.2.9. Specifically, we shall show that every model of PCA 
is the r-image of a model of Zgm, where r is the following translation. 
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Definition 4.3.1. Let t be the translation from Lpsize to Lz gm where: 



Ty = r X = X n 
TSum(x,y,z) = F X + y = zAx<Z n 
TUnit(x) = r X=l n 

Ts = r x + y = r 
t x<v = r x < y A x ^ tp 
t = r x = 0" 1 



Given a model, A, of PCA, we can construct a model, Z^, of Zgm directly: 



Definition 4.3.2. If A 1= PSIZE, then is the interpretation of Lz gm in 
which: 



(a) Z A = A 

(b) Z A \=[x<y) iffA\={x<y\Jx = y) 

(c) Z A (I)=A(I) 

(d) Z A (0) = A(0) 

(e) Z A 1= (x + y = z) iff A N Sum(x,y,z) 

or A N 3a3wUnit(a) 

A Sum(x, y, w) A Sum(w, a, z) 

(f) .2U N (a; = 1) iff A\= Unit(x) 



Fact 4.3.3j establishes that 4.3.2 gives Sum a functional interpretation. The- 
orem 4.3.8 establishes that Z A N Zgm, on the basis of the intervening facts: 4.3.3 
deals with the model A of PSIZE; 4.3.4 deals with the corresponding model Z A ; 
4.3.6 verifies some connections between A and Z A . 



Fact 4.3.3. The following are theorems of PSIZE. (Sm(x,y) abbreviates 3zSum(x 
) 
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(a) Sum(x, y, z\) A Sum(x, y, z 2 ) — > z\ = z 2 (UNIQ + ) 

(b) Sum(x,y 1 ,z) A Sum(x,y 2 ,z) -> yi = y 2 (UNIQ~) 

(c) Sum(x,y,wi) & Sum(wi, z, w) — > (ASS0C2) 

3w 2 (Sum(y, z, w 2 ) A Sum(x, w 2 ,w)) 

(d) x = x 

(e) x < y ^ y < x 

(f ) x<yAy<y^>x<x 

(g) Sm(x,y)\J Sm(x,y) 
(hi) Sum(x, y, z) — > Sum(z 7 y, x) 
(h2) Sum(x, y, z) — > Sum(x, z, y) 

(i) Sum(x, y, zi) A Sum(x, y, z~ 2 ) -> (zi = z 2 = J) 

(j) 3ziSu/n(x,y, zi) <-> 

-i3a3«;322 ( Unit(a) A Sum(x, y, w) A Sum(w, a, z 2 )) 



Proof. The proofs are elementary. □ 

In addition to the theorems of PSIZE listed in 4.3.3, we require a battery of 
tedious facts about the model Za- We shall state these in terms of the model 
Za, an expansion of both A and Za, which interprets two additional operators, 
as follows: 

Definition 4.3.4. Given a model A of PSIZE, Za is the expansion of Za 
induced by the definitions in 4-3.2 together with (a) and (b): 

(a) Z A \= y = -x iff Z A N x + y = 

(b) Z A N z = x - y iff Z A N z = x + -y 

Remark. Given a model A of PSIZE: for each x in Za, there is a unique y 
such that: 

Za N x + y = 

Proof. 

a. If x = 0, then Za Nx + y = 0iffy = 0. 

=> 

If Z A N + y = 
then .4 1= Sum(0,y, 0) 
But .4N Sum(0,0,0) 
So Z A N y = 



by 4.3.2e 
by IDENT 
by UNIQ- 
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4= Obvious 

b. If x > 0, there is a unique y which satisfies 

(*) Sum(x, a, y), where Unit (a) 

since there is a unique unit and A \= UNIQ+. 
But Z A N x + y = iff A N (*). 



If Z A N x + y = 

then A N Sum(x, y, w) 

and A N Sum(w, a, 0) 

But A N = I 

So .4 1= w = a 

So .41= Sum(ai, y, a) 

So .4 h Sum(J, a, y) 



since .4 1= Sum(:r, y, 0) 



by COMP and UNIQ" 



by 4.3.3d and h2 



If A\= (*) 

then A \= Sum(x, y, a) 

and A 1= Sum (a, a, 0) 

So Z A \= x + y = 



by 4.3.3h2 
by COMP 
by 4.3.2c 

□ 



Fact 4.3.5. Z A satisfies the following: 



(a) 
(b) 
(c) 
(d) 
(e) 

Proof. Omitted. 



-(x + y) = -x + -y 
~(x -y) = y-x 
(x + y)-y = x 
— (—x) = x 
x^0Ax<y^—y< —x 



□ 



Fact 4.3.6. Z A satisfies the following: 

a. Sm(a, b) <-> Sum(a, b,a + b) 

b. c < b <-► Sum(c, b — c,b) 
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c. Sum(a, b, c) A < b — > Sum(a, — c, —6) 
Sum(a, b, c) A < a — > Sum(—c, b, —a) 

d. -iSm(a, b) — ► (Sm(—a, —b) V & = —a) 
Proo/. 

a. (=») 

Suppose Sm(a, 6) 

So Sum(a, 6, w) for some u> 

So (a + &) = k; 

So Sum(a, b, a + b) 

(<=) Obvious. 

b. (=►) 

Suppose c < 6 

then Sum(c, w, 6) for some ty 

So 6 = c + w 

So b — c = (c + w) — c 

So b — c = w 

So Sum(c, 6 — c, b) 

Suppose Sum(c, 6 — c, 6) 

But Sum(c, 0, c) 

So < (6 - c) <-> 6 < c 

So 6 < c 



by 4.3.2e 



by EXIST" 

by (a) and UNIQ+ 

by 4.3.5c 



by IDENT 

by MONOT and UNIQ+ 
by MIN 



c. 



Suppose Sum(a, 6, c) 

and < b 

then c = (a + b) 

So c - b = {a + b) - b 

So c — b = a 

So — b = a — c 

So Sum(a, — c, — b) if Sm(a, — c) 

But if a = 

then Sm(a, — c) 



by (a) and UNIQ H 

by 4.3.5c 
by 4.3.5c 

by (a) 

by IDENT 
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And if a ^ 

then a < c, since < b 

So — c < — a 

So — c < a 

But Sm(a,a) 

So Sm(a, — c) 



by MONOT and UNIQ+ 
by 4.3.5e 
by 4.3.4 
by IDENT 
by EXIST+ 



Suppose -6m (a, b) 

But b = (a + b) — a, 

So -iSum(a, (a + b) — a, a + 6), 

So ^(a<a + 6), 

So (a + &) < a, 

But if (a + b) ± 

then — a < — (a + 6), 

So — a < — a H — b, 

So Sum(— a, (— aH — 6) — (— a) 

But (-a + -b) - (-a) = -6, 

So Sum(— a, —6, —a H — 6) 

So Sm(-a,-i)) 

And if (a + 6) = 

then b = —a 



-a + -b), 



by 4.3.5c 
by (a) 
by (b) 
by TRICH 

by 4.3. 5e 
by 4.3.5a 
by (b) 
by 4.3.5c 



□ 

We can now show that Zj, t= Zgm. The only real difficulty arises in verifying 
that addition is associative. We need the following lemma. 

Lemma 4.3.7. Z A \= (a + c) + (b - c) = a + b 

Proof. We shall work through successively more general cases: 

a. When Sm(a, b) and c < b 

We know Sum(6 — c, c, b) by 4.3.6b 

and Sum(6, a, a + b) by 4.3.6a 

So 3w(Sum(c, a, w) 

ASum(b-c,w,a + b) by ASSOC2 

So w = c + a by 4.3.6a and UNIQ+ 

and a + b=(b-c) + (c + a) by 4.3.6a and UNIQ+ 

So a + b= (a + c) + (b-c) 
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b. When Sm(a, b) and Sm(a,c) 



If c < b, Case (a) applies directly 

So assume b < c 

then a + c = (a + b) + (c — b) 

So (a + c) - (c- b) = a + b 

So (a + c) + -(c - 6) = a + 6 

So (a + b) + (b - c) = a + b 

c. Sm(a, b) 

If Sm(a, c) Case (b) applies 

So assume -iSm(a, c) and thus b < c 

So Sum(6, c — 6, c) 

and < c — 6 

So Sum(fe, — c, — (c — 6)) 

So Sum(6, — c, 6 — c) 

So Sum(— c, 6, 6 — c) 



by Case (a) 
by 4.3.5c 
by 4.3.4b 
by 4.3.5b 



by EXIST+ 
by 4.3.6b 
by MONOT 
by 4.3.6c 
by 4.3.5b 
(cl) 



Either Sm(— a, — c) or c = — a since -iSm(a, c) 

Assume Sm(— a, — c) 

then Sum(— a, — c, — a H — c) 

So Sum(— a, — c, — (a + c)) 

and < — a 

So Sum(-(-(a + c)), -c, -(-a)) 

So Sum(a + c, — c, a) 



by 4.3. 6d 

by 4.3.6a 
by 4.3.5a 

by 4.3.6c 

by 4.3. 5d (c2) 



Assume c = — a 

then a + c = 

and a = — c 

Hence Sum(a + c, — c, a) 

and Sum(— c, 6, 6 — c) 

and Sum(a,6, a + 6) 

So Sum(a + c,b — c,a + b) 

So (a + c) + (fe - c) = a + 



by (c2) 
by (cl) 
since Sm(a, b) 
by ASSOC 
by 4.3.6a 



d. Whenever 



Suppose -iSm(a, 6), for otherwise case c applies. 
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Either Sm(— a, — b) or b = — a 



by 4.3.6d 



Assume b = — a 
then a + b = 
and b — c = (—a) — c 
= — a H — c 
= -(a + c) 
So (a + b) + (b- c) 

= (a + c) + -(a + c) 
=0=a+6 
Assume Sm(— a, —b) 

then — a H — b = (—a H — c) + (— & c) by case c 

So - (a + b) = -(a + c) + -(b- c) by 4.3.5a 

So -(a + b) = -((a + c) + (b-c)) by 4.3.5a 

So a + b = (a + c) + (b - c) by 4.3.5d 



Theorem 4.3.8. Z A (= Z<?m 

Proof. The only axiom for abelian groups that needs further verification is as- 
sociativity. We prove this using lemma 4.3.7: 



The ordering axioms of Zgm are satisfied in Z A because Z4 uses the same 
ordering as A, A \= PSIZE, and PSIZE includes the same ordering axioms. 

satisfies axiom (10) of Zgm because A satisfies the UNIT axiom of PSIZE. 

The divisibility of elements in required by axiom (11) of Zgm is guaran- 
teed by the fact that A satisfies the divisibility principles of PCA. 

Similarly, Z4 satisfies axioms (12), and (14) of Zgm because A satisfies MIN 
and MAX. 

Axiom (13) of Zgm is: 



□ 



x + (y + z) = (x + y) + {(y + z) - y), by 4.3.7 
= (x + y) + z, by 4.3.5c 



y<zAx<x + z^>x + y<x + z 



If 



-2 a ^ x < x + z 

A \= Sum(x, z, x + z) 

Za i= y < * 

A 1= Sum(x, y,x + y) 

A\= x + y < x + z 
Z A ^x+y<x+z 



then 



Since 



then 



and 



by EXIST+ 
by MONOT 



So 



□ 
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So, wc PCA is T-rcduciblc to Zgm. The reduction is uniform since each 
model A of PCA has the same domain as its Zgm-model. So, we may conclude 
that PCA is model complete. 

Theorem 4.3.9. 

a. PCA is model complete. 

b. PCA satisfies Monk's condition. 
Proof. 

a. Apply Theorem 4.2.9. 

b. Immediate from (a) and 4.2.1(b). 

□ 

4.4 Remarks on showing that PCA axiomatizes 
PCS 

To prove that PCA = PCS, we could follow the method outlined at the end of 
chapter 3 for showing that CA = CS. 

We already know that PCS h PCA, so we need only prove (1), which follows 
from (2) by 3.3.1b. 

(1) PCA h PCS 

(2) Every completion of PCA is consistent with PCS. 
But (2) is equivalent to the conjunction of (2a) and (2b). 

(2a) Every finite completion of PCA is consistent with PCS. 
(2b) Every infinite completion of PCA is consistent with PCS. 

The finite completions of PCA are the theories PCA; EXACTLY„. But 
PCA;EXACTLY„ is true in S4, where A is the standard finite interpretation 
of Lc< containing n atoms. So, the finite completions of PCA are consistent 
with PCS. (Formally, we would have to define EXACTLY^ in terms of units 
rather than atoms.) 

Letting PCAI = PCA + INF and PCSI = PCS + INF, (2b) is a consequence 
of (3a) and (3b). 

(3a) If PCAI/ is consistent, then PCSI/ is consistent. 



(3b) If T is a completion of PCAI, then T = PCAI/, for some /. 
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To prove (3a), we would have to prove analogues of 3.3.9 through 3.3.13 for 
PCA and PCS. This seems straightforward, but tedious. The trick is to show 
that enough axioms about size have been incorporated in PCA to establish the 
entailments among MOD statements. 

To prove (3b), it is sufficient to demonstrate (4), because every completion 
of PCAI entails PCAI/ for some total /. 

(4) If / is total, then PCAI/ is complete. 

But PCA is model complete, so only (5) remains to be shown. 

(5) If / is total and congruous, then PCAI/ is has a prime model. 

We will not construct prime models for the extensions of PCAI. It's apparent 
that the size models of the prime models for CAI / would do nicely. Alternatively, 
the construction could be duplicated in this simpler case. 



5 

Completeness of CA 



5.1 Model completeness of CA 

We shall show that CA is model complete by showing that it satisfies Monk's 
criterion (see 4.3.1). So, given Assumption 5.1.1, we want to prove 5.1.2. 

Assumption 5.1.1. A 1= CA, B 1= CA, A C B , and C is a finitely generated 
substructure of B. 

Theorem 5.1.2. There is an isomorphic embedding, /: C —* A, where 

f(x) =x ifxeCnA 

The Monk mappings for PC A can serve as a guide in constructing Monk 
mappings for CA. The existence of Monk mappings for PCA tells us that we 
can find elements with the right sizes. DISJ U and REP < then allow us to find 
elements with those sizes that fit together in the right way. 

Strictly speaking, 1S4 is not a submodel of Sb, so we cannot apply Monk's 
Theorem directly. But, let 

Sb(X) = the submodel of Sb whose domain is { ob[.t] x G X } 

Then, clearly, 

5^4 ~ S B (A) C Sb, and 

Sc — Sb{C), a finitely generated submodel of 

Monk's Theorem applies directly to Sb(A), Sb, and Sb{C), so we may con- 
clude 5.1.3: 

Fact 5.1.3. There is an isomorphic embedding, g: Sc — > Sa, where 
g(aB[x]) = &a[ x ] f or all x E C C\ A. 
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That is to say, the sizes of elements in C can be embedded in the sizes of 
elements in A. It remains to be shown that the elements of C themselves can be 
mapped into A by a function which preserves boolean relations as well as size 
relations. 

C is a finite, and hence atomic, boolean algebra, though its atoms need not 
be atoms of B; indeed, if B is infinite, there must be some atoms of C which are 
not atoms of B since the union of all atoms of C is the basis of B. 

Definition 5.1.4. d is a molecule iff d € C H A and no proper subset of d is 
inCnA. 

C n A is a boolean algebra whose basis is the same as the basis of C. So 
every atom of C is included in some molecule. The embedding / has to map 
each molecule to itself. Moreover, / has to be determined by its values on C 
since / must preserve unions. In fact, the atoms of C can be partitioned among 
the molecules. So, if 

d = h U . . . U b n 

where d is a molecule and b\ , . . . , b n are the atoms of C contained in d, then d 
must also be the (disjoint) union of f{b\), • ■ • , f(b n ). If this condition is satisfied, 
/ will preserve boolean relations. / must also select images with appropriate 
sizes. 

Proof. Proof of 5.1.2 Given a molecule, d, let b\,...,b n be the atoms of C 
contained in d. For each bi, let Ci be some member of g{o- B {bi\) These elements 
will be elements of A, since g yields sizes in A whose members are in A. These 
elements have the right sizes, as we will show below, but they are in the wrong 
places. We have no guarantee that they are contained in the molecule d. So we 
still need to show that there are disjoint elements of A, a\, . . . , a n whose union 
is d and whose sizes are the same as those of b\, . . . , b n , respectively. Well, 

Suppose d = b\ U . . . U b n 

So C 1= Sum(6i, . . . , b n , d) 

So S c N Sum(o- B [&i], . . . ,a B [b n ],(j B [d]) 

So 1= Sum(g(CT B [6i]), . . . , g(a B [b n }) 7 g(a B [d])) 

So S A \=Sum(ci,...,Cn,d) 

since each Ci e g(o~ B {bi}) and d 6 g{a B [d]) = <Jj\[d]. So, the existence of 
oi, . . . , a n , as above, is guaranteed because A N DEF+. 

Now, let f(bi) — ai for each bi in the molecule d. Repeating this procedure 
for each molecule yields a value of / for each atom of C. Finally, if x E C is 
non-atomic, then 

x = b\ U . . . U bk 

where each bk is atomic. So let 



f(x) = f^) U . . . U f(b k ) 
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/ satisfies the requirements of Theorem 5.1.2: Boolean relations are preserved 
by / because the function is determined by its values on the atoms of C; f 
maps elements of C n A into themselves because the set of atoms contained in 
each of these molecules is mapped into a disjoint collection of elements of A 
whose union is the same molecule; so, we need only show that / preserves size 
relations. To do so, we invoke lemma 5.1.6, below: 

(a) C N x < y iff A N f(x) < f(y) 

(b) C N x ~ y iff A N /(*) ~ f(y) 

(c) C N Sum(x, y, z) iff A N Sum(/(x), f(y), f(z)) 

(d) C N Unit(x) iff .Ah Unit(y) 



Proof of (a) : (The others are similar) . 



C N x < y iff S B N a B [x] < a B [y] 

iff S A 1= g(a B [x}) < g(cr B [y}) 

iSS A ^a A [f(x)]<a A [f(y)} 
iff A N f{x) < f(y) 

So, we may conclude: 
Theorem 5.1.5. CA is model complete. 

Lemma 5.1.6. For all x in C, 

VA[f{x)] = g(a B [x\) 

Proof. Suppose 

x = b\ U • • • U b n 

where each bi is an atom of C. 

So B 1= Sum(6i, . . . ,b n ,x) 

So S B \= Sum(cr B [6i], . . . ,a B [b n ],a B [x]) 

So S A N Sum(g((Tg[6i]), . . . ,g(a B [b n ]),g(a B [x])) 

But g(a B [b}) = a A [f(b)] 

So S A N Sum(cr^[/(6i)], . . . ,a A [f(b n )],g(o- B [x})) 

But ^NSum(/(ti) /^/(s)) 

So S A \=Su m (a A [f(h)},...,a A [f(b n )},a A [f(x)]) 

So g{o- B [x\)=a A [f{x)] 



by 5.1.6 



□ 



since B N DISJ U 

since 1S4 C <S B 

by the choice of f(b). 

since .4 N DISJ U 

since S4 N UNIQ+ 



□ 
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5.2 Prime models for CA 

For each total, congruous remainder function, /, we want to find a prime model, 
Qf, for CAI/. All of these prime models can be defined over the class, Q, of 
sets near quasi-congruence classes (see section 3.2). For different remainder 
functions, we need to assign different size relations over Q. Section 5.2.1 defines 
the structures Qf and verifies that each satisfies the respective theory, CAI/. 
Section 5.2.2 defines, for each model of CAI/, a submodel, or shell. Section 
5.2.3 shows that Qf is isomorphic to the shell of any model of CAI/. 

5.2.1 The standard protomodels 

The construction here is a more elaborate version of the construction of Q in 
chapter 3 (see 3.2.19). Q turns out to be Qf, where f(n) = for all n. As in 
the case of Q, the models Qf and their copies in arbitrary models of CAI are 
unions of chains. 

The sizes assigned to elements of Q to induce Qf for a total, congruous 
remainder function, /, are more elaborate than those used in the definition of 
Q (see 3.2.22), but they are employed in substantially the same way: 

Definition 5.2.1. 

a. A size is an ordered pair (p, S) , where both p and S are rational. 

b. If 9\ = (pi,Si) and 9 2 = {p 2 ,S 2 ) are sizes, then 

(i) 9x < 9 2 iff pi < pi or p 1 = p 2 and S 1 < S 2 . 



To assign sizes for Qf we rely on the representation of sets in Q defined in 
3.2.24. Qf, unlike Q, assigns different sizes to the n-congruence classes for a 
given n. 

Definition 5.2.2. /// is total and congruous, then 
a. If x is an n-congruence class, x — [nk + i], then 



b. If x G QC n , so that x is the disjoint union of n-congruence classes, 



(a) e l + e 2 = ( Pl + P2 ,5 1 + s 2 ) 



(cf 3.2.22) 




x\ U . . . U Xk 



then 



9f(x)=6f(x 1 ) + --- + f (x k ) 
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c. If x is finite, then 

f (x) = (0,\x\) 

d. If x <G Q, so that x can be represented as 

(C(x)uA 1 (x))-A 2 (x) 

as in 3.2.24, then 

O f ix) = e f {C{x)) + 6f(Ai(x)) - 6 f (A 2 (x)) 
= 9 f (C(x)) + (0,\A 1 (x)\)-(0,\A 2 (x)\) 
= e f (C(x)) + (0,\A 1 (x)\-\A 2 (x)\) 

Intuitively, all n-congruence classes are assigned sizes (l/n,5), but 5 is no 
longer in all cases, as in Q. Instead, the first f(n) n-congruence classes are 
each one atom larger that the remaining (n — f(n)) n-congruence classes. 

The desired models of CAI may now be defined: 

Definition 5.2.3. Iff is total and congruous, then the standard protomodel 

for f is Qf, where 

Q { = Q 

Q f ^x<y ij?6 f (x)<6 f (y) 
Q f tx~yiff6f{x)=6 f {y) 
Q f hUnit(x) iff e f (x) = (0,1) 
Q f \= Sum(x,y,z) iff9 f (z) = 6 f (x) + 9 f (y) 

(cf 3.2.27) 

To verify that the structure Qf is a model of CAI/, for total and congruous 
/, we exhibit each such model as the union of a chain of models. 

Definition 5.2.4. If f is total and congruous, then Qf, n is the submodel of Qf 
whose domain is Q n . 

Fact 5.2.5. If f is total and congruous and n > 0, then Qf, n 1= BASIC. 

Proof. The proof can be obtained from the proof of Theorem 3.2.29 by substi- 
tuting: 

Qf,n for Qn, 

9f{x) for B[x), 
Pf(x) for p(x), and 
Sf(x) for S(x) 



□ 
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The following notion is helpful in understanding our constructions. 

Definition 5.2.6. Suppose A \= BASIC, x £ A, and < m < n. Then an 
(n.m) -partition of x in A is a sequence X\, . . . , x n , where 

a. i = i 1 U...Ui n 

b. IfO<i<j< n, then Xi fl Xj — %, and 

c. The Xi are approximately the same size: 

(i) If < i < j <m, then Xi ~ Xj, 

(ii) If m < i < j < n, then x^ ~ Xj, 

(Hi) If0<i<m<j<n, then Sum(xi,a, Xj), for any atom a, 

In other words, x is partitioned among n pairwise disjoint sets which are roughly 
the same size: each of the first m is one atom larger than each of the remaining 
n — m. If < i < m, Xi is called a charmed n-J 'actor of x; for i > m, Xi is a 
common n-factor of x. 

The sequence Qf, n does not constitute a chain of models. For example 
is not an extension of Q/,2- But this sequence does harbor a chain of models: 

Fact 5.2.7. If n > m, then Q/, n ! C Q/ lTO i. 

Proof. It will be clearer, and easier, to establish this by example rather than by 
formal proof. Letting n = 2 and m = 3, we want to show that the 2-congruence 
classes have the same size relations in Q/ j6 as they do in Q/,2, for any /. The 
other elements of Q will then fall into place, since size relations are determined 
by the representations of each set, x, as C(x), Ai(x), and A2(x). 
Suppose that /(2) = 0, so that 

Q fi2 N [2n] ~ [2n+l] 

Since / is congruous, /(6) G {0,2,4}. If /(6) = 0, then all of the 6-congruence 
classes are common. If /(6) = 2, then [6n] and [6n + 1] are the only charmed 3- 
congruence classes. If /(6) = 4, then all of the 3-congruence classes are charmed 
except for [6n + 4] and [6n + 5] . 

In any case, [2n] will include the same number of charmed 3-congrucncc 
classes as [2n +1], so 

Q /i6 N [2n] ~ [2n+l] 

Suppose, however that f(2) = 1, so that 

<2/ t 2 1= [2n] is one atom larger than [2n + 1] 

Here, /(6) £ {1,3,5}, since / is congruous. In any case, [2n] contains exactly 
one more charmed congruence class than [2n +1], so 

1= [2n] is one atom larger than [2n + 1] 

So it goes in general. □ 
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Fact 5.2.7 allows us to regard Qf as the union of a chain of models: 
Fact 5.2.8. If f is total and congruous, then 

Qf = IJ 2/,n! 
ri>0 

Fact 5.2.9. /// is toia? and congruous, then 

a. Q f N BASIC 

b. Q f N ^£>/V fe , for k>0 

c. Qf \= MOD nJ{n) , for n > 

d. Q f N G4// 
Proof. 

a. BASIC is a universal-existential theory; so it is preserved under unions of 
chains. (See Appendix, Fact B.5). 

b. Each n-congruence class can be partitioned into k n/c-congruence classes. 

c Qf, n 1= MOD„j( n ), by definition. Since MOD„j(„) is an existential sen- 
tence, it is preserved under extensions. 

d. Immediate from (a), (b), and (c). 

□ 

5.2.2 Shells of models 

To embed the model Qf into an arbitrary model, A, of CAI/, we must find a 
smallest submodel, B, of A which satisfies CAI. Clearly, the basis of A, call it 
xo, must be included in B, since the symbol I must refer to the same set in the 
submodel as it does in the model. But, if the basis of A is in B and B \= CAI, 
then B must contain two disjoint sets of roughly the same size whose union is 
the basis of A, xq. Pick such a pair, x\ and X2, to include in B. Whether these 
are exactly the same size or differ by an atom depends on whether A N 1^002,0 
or A N MOD 2 ,i. 

B must also satisfy ADIV3. We can aim for this by placing in B three disjoint 
sets, x\\, x\2, and x\z whose union is x\ and another three disjoint sets, £21, 
£22, and a^23 whose union is x-i ■ The existence of such sets is assured because 
A 1= ADIV3. Again, the exact size relations will be determined by which MOD 
principles are satisfied in A. Insuring that x\ and X2 are each divisible by 3, 
also guarantees that Xo is divisible by 3: the three unions 



xu U£ 2 i,a;i2 UX22, andxi 3 Ua;23 
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will be roughly the same size and will exhaust xq. 

We can continue this process indefinitely, dividing each set introduced at 
stage n into n + 1 roughly equal subsets at stage n + 1. This will produce an 
infinite tree, bearing sets. The deeper a node is in this tree, the smaller the set 
it bears and the greater the number of successors among which this set will be 
partitioned. 

This great tree of sets will not form a boolean algebra, nor will it be closed 
under finite unions. A boolean algebra could be obtained by including both the 
node sets and their finite unions, but this would still not be an atomic boolean 
algebra, which is what we are looking for. We cannot correct for this problem 
by including in B all atoms of A: A may have uncountably many atoms while £>, 
to be a prime model, must be countable. We leave the solution of this problem 
to the formal construction. 

The formal proof proceeds as follows: First, we define a tree, i.e. the set of 
nodes on which we shall hang both the components of the successive partitions 
described above and the atoms of the submodel being constructed. Second, we 
present the construction which, given a model A of CAI/, assigns a node set 
Aa and a node atom, aA, to each node, A. Third, we define the shell of A as 
the submodel of A generated by the collection of node sets and node atoms. In 
the next section, we show that the shell of A is isomorphic to Qf. 

First, the tree: 

Definition 5.2.10. 

a. A node is a finite sequence (ni, . . . , nk) , where k > and for all i < k, 
ni < i. (The variables X and k range over nodes.) 

b. If A = (ni, . . . , nfc), then 

(i) The length, or depth, of X, L(A) is k. 

(ii) If 1 < i < k, then X(i) = rii. 
(Hi) A • to = (m, . . . , rife, to) 

c. A extends k iffL(X) > L(k) and, for 1 < i < L(k), X(i) = n{i). 

d. X 0-extends n iff X extends k and, for L(k) < i < L(A), X(i) = 0. 

Nodes constitute the vertices of an infinite tree in which (0) is the root and 
A dominates k iff k extends A. The number of immediate descendants of a node 
grows as the depth of the node increases. 

We shall now assign a set to each node by repeatedly partitioning the basis 
of A. At the same time we shall assign an atom to each node. 

Definition 5.2.11. Given A = Af \= CAIf, where f is a total, congruous 
remainder function: 

a. Let A( ) be the basis of A, and let a( ) be any atom of A. 
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b. Suppose that A\ and a\ have been chosen. Let m = L(A) and let 

fc= /((m + l)!)-/(m!) 
to! 

Since f is congruous, k is an integer (see Definition 3.3.7e). Let 

A>,Oj • • • j A.\ mm 

be an (to + 1, k) partition of A\ if A\ is common or an (m + l,k + 
1) partition of A\ if A\ is charmed. Fact 5.2.12 guarantees that such 
partitions exist. In either case, choose Aa.o so that it contains ax- This 
is always possible because A\ contains &\. 

c. Let aA.o = sl\- If < i < m, let be any atomic subset of A\ ti . 

The sets A\ will be referred to as node sets and the atoms &\ as node atoms. 

Fact 5.2.12. Suppose A N CAI f , n > 0, m > 0, and 

= /M - /(re) 
re 

Then 

a. x has an (m,k) -partition iff A 1= Mod mt k{x) 

b. Every common n- factor of A has an (to, k) -partition. 

c. Every charmed n-factor of A has an (m, k + \)-partition. 
Proof. 

a. Mod mj fc(x) says that x has an (to, fc)-partition. 

b. All common re-factors are the same size and satisfy the same Mod mi fc 
predicate. So, suppose that each common re-factor has k charmed to- 
factors and to — k common rei-factors. 

By (a), A has /(re) charmed n-factors and n — /(re) common re-factors. 
Partitioning each of the re-factors into to subsets of roughly the same size 
yields an rerei-partition of A; the charmed rei-factors of the re-factors are 
the charmed nm factors of A and the common TO-factors of the re-factors 
are the common rem factors of A. 

Each of the common re factors has k charmed m-factors and each of the 
charmed re-factors has k + 1 charmed rei-factors. In all, there are 

(re-/(re))fc + /(re)(fc + l) 
=nk + /(re) 

charmed rei-factors among the re-factors of A. 
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But, by (a) again, there are f(nm) charmed nra-factors of A. So 

f(nm) = nk + f(n) 
and k = (f(nm) — f{n))/n 

c. Immediate from (b), since every charmed n- factor is one atom larger than 
each common n- factor. 

□ 

The facts listed in 5.2.13 should clarify this constuction. All of them can be 
established by induction on the depth of nodes. 

Fact 5.2.13. 

a. A\ C A K iff k extends A or k = A. 

b. If A\ = A K , then k = A. 

c. There are n\ nodes (and, hence, node sets) of depth n. 

d. Ifi^ j, then A\.i PI A A .j = 0. 

e. Any two node sets of the same depth are disjoint. 

f. Each node set is the disjoint union of its immediate descendants. 

g. Each node set is the disjoint union of all of its descendants at any given 
depth. 

h. &\ c A K iff X extends k. 

i. &\ = a K iff one of A and n 0-extends the other. 

j. Every node set contains infinitely many node atoms. 

k. For any n, the node sets of depth n form an (n! , / (n\)) -partition of the 
basis of A. 

Proof. We demonstrate (k) by induction on n.: If n = 1, then nl = 1, f(nl) = 0, 
and A^ (0> is the basis of A. So (k) holds because any set is a (l,0)-partition of 
itself. 

Assume that (fc) holds for n. Then (k) also holds for n+ 1: each node set of 
depth n has n + 1 immediate descendants; so, there are (nl)(n + 1) = (n + 1)! 
node sets of depth n + 1. 

Furthermore, f{n\) of the n-factors are charmed and nl — f(n\) are common. 
By Fact 5.2.12, each charmed n-factor has an (n + 1, k + l)-partition and each 
common n-factor has an (n + 1, fc)-partition, where 

u _ f(n!(n + l))-f(n!) 



/((n+l)!)-/(n!) 
n! 
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(substituting n! for n and n + 1 for m). 
So, there are 

(fc + l)/(n!) charmed n + 1 factors from the charmed n-factors 
and k(nl — f(nl)) charmed n + 1 factors from the common n-factors 

In all, then , the number of charmed n-factors is: 

(k + l)/(n!) + k(n\ - f(n\)) 

= kf(n\) + f(n\)+n\k- f{n\)k 
= f(n\)+n\k 

= /(n!) + /((n!)(n + l))-/(n!) 
= /(n!)(n + l)) 

= /((« + 1)0 

So, the n + 1 factors of the n-factors of the basis form an ((n+ 1)!, f((n + 1)!))- 
partition of the basis. That is to say, (k) holds for n + 1. □ 

Given a collection of node sets, Aa, and node-atoms, aA, from a model, A, 
of CAI/, we can now construct a submodel, B, of A which is isomorphic to Q/. 

Definition 5.2.14. Suppose f is total and congruous, that A 1= CAIf and Aa 
and aA are the node sets and node-atoms of A produced by construction 5.1.12. 
Then, the shell of A is the submodel of A generated by {Aa^a}. 

For the remainder of this chapter, we will regard as fixed: 

a. /, a total, congruous remainder function 

b. A, a model of CAI/, 

c. {Aa, aA}, a set of node sets and node-atoms produced by the construction 
above. 

d. A, the shell of A generated from {Aa, aA} 

To show that A ~ Qf, we need a sharper characterization of the elements 
of A. Recall from 3.2.24 that each member, x, of Q has a unique representation 

as 

(<7(a;)UAi(a;))-A 2 (a;) 
where C(x) is a quasi-congruence class, Ai(x) and A 2 (x) are finite sets and 

Ai(i)nC(i) = 

A 1 (x)nA 2 (a;) = 
A 2 (x) C C{x) 
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We can obtain a similar representation for elements of A: the node sets play 
the role of (some of) the congruence classes; finite unions of node sets correspond 
to the quasi-congruence classes; finite sets of node-atoms correspond to the finite 
subsets in Q. 

Definition 5.2.15. 

a. x is a quasi-nodal set of A iff it is the union of finitely many node sets 
(iff it is the union of finitely many node sets at a given depth). 

b. x is an A-finite set iff it is a finite set of node atoms. 

c. If x £ A and y G A, then x is A-near y iff both x — y and y — x are 
A-finite sets, 

(cf. 3.2.14-3.2.18) 

Still following in the footsteps of chapter 3, we can characterize A as the 
collection of sets ^4-near quasi-nodal sets. Analogues of 3.2.15 through 3.2.18 
obtain for „4-nearness. 

Fact 5.2.16. 

a. x G A iff x is A-near some quasi-nodal set of A. 

b. A is an atomic boolean algebra whose atoms are the node-atoms &\. 

c. If x £ A, then x has a unique representation as 

(C(x)UA 1 (x))-A 2 (x) 

where C(x) is a quasi-nodal set disjoint from Ai(x) and including ^{x), 
both of which are A-finite sets. 

Proof. 

a. (^)A is generated from node sets and node atoms via the boolean oper- 
ations, each of which preserves ^4-nearness to quasi-nodal sets. 

(<=)A must contain finite unions of node sets as well as ^4-finite sets; so it 
must also contain sets obtained by adding or removing ^-finite sets from 
quasi-nodal sets. 

b. The proof parallels that of Theorem 3.2.21 exactly. 

c. Let C(x) be the quasi-nodal set which is „4-near x (see 3.2.23); Let 
Ai(x) = x — C(x); and, let A 2 (x) = C{x) — x. 

□ 
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5.2.3 The embeddings 

To embed Qf into A, we first describe Q in terms of node sets and node atoms. 
In effect, we arc performing the construction 5.2.12 on Qf, but with two dif- 
ferences: first, we are stipulating which (n, m)-partitions to use at each level; 
second, we are selecting node atoms so that every singleton in Q is the node 
atom for some node. This latter condition guarantees that the shell of Q / will 
be Qf itself. 

Definition 5.2.17. Suppose X is a node. Then 

a. IfL(X) = k, then 

Qx = [kin + m] 

where 

k 

m = £A(i)(i-l)! 

i=l 

b. The depth of Q\ is L(A). 
Examples. 

Q(o> = [n] 

Q<o,o) = [2n] 

Q (0 ,i) - [2n+l] 

Q(o,i,o> = [6n+ 1] 

Q(o,o,2> = [6n + 4] 

Q<o,i,2> = [6n + 5] 



Definition 5.2.18. 

a. b{\) = the least n € Q\. 

b. q A - {t(A)} 

Fact 5.2.19. 

a. If A = (m, . . . , nk), then 

k 

t (A) = (* - 
»=i 

6. t(A) = t(«) iff X = k or one 0-extends the other. 
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c. For every n, there's a A such that n = 

d. For every n, there are infinitely many A such that n = i(X). 

e. For every n, there's a A such that n = l(k) iff k = A or k O-extends A. 

Fact 5.2.20. 

a. At each depth, n, the t(A) take on all and only values less than n\. 

b. If t(A) = k, then all nodes along the left-most branch descending from A 
also have the value k. These are the only nodes below A with the value k. 

c. Every natural number is the value of all and only those nodes along the 
left-most branch descending from some node. 

Though for a given natural number n, there will be infinitely many nodes A 
for which t(A) = n, we can associate with each natural number a shortest (i.e. 
shallowest) node for which t(A) = n. 

Definition 5.2.21. A„ = the shortest A such that t(A) = n. 

Fact 5.2.22. 

a. i(A„) = n 

b. A extends A t (» 

c- \(\ n ) = Ki 

d- A t ( A ) = A iff A =< > or A(L(A)) ^ (ie a node, X, will be the highest 
node with a certain value just in case A is not the leftmost immediate 
descendant of its parent.) 

Each of the points listed in Fact 5.2.13 hold for the sets Q\ and q\. That is 
to say, the Q\ can be regarded as node sets and the q\ as node atoms for any 
model Qf. Notice, especially, that 5.2.13k holds. 

We may, finally, define the embedding of Q/ into A: 

Fact 5.2.23. If x E Q, there is a unique y G A such that 

VA(q A CinajC^) 

Proof. There is at most one such y, by Fact 5.2.16. To show that there is such 
a y, suppose first that x G QC. 
Then there is some n such that 

x = x\ U • • • U Xk 
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where each Xi is an (n!)-congruence class and hence a node set in Qf. So 

x = Q Al U • • • U Q Afc 

Now, let 

y = A Xl U • • • U A Afc 

So y 6 A and: 

q A C x iff q A C Q A , , for some i 

iff A extends Aj by 5.2..13h 

iff a A C A Ai 
iff a A C y. 

If x is not a quasi-congruence class, then 

.x = (x' U Ai(x)) - A 2 (x) 

where x' is a quasi-congruence class. Let y' be the element of A corresponding 
to x' , as described above, and let 

y = (y'U{a A | q A C Ai(x)}) - {a A | q A C A 2 (x)} 

□ 

Definition 5.2.24. The nodal embedding of Q into A is defined by letting 
g(x E Q) be the y E A such that 

VA(a A Ci/Hq A Ci) 

Fact 5.2.25. 

a. If x € Q, then 

g(x) = (g(C(x))Ug(A 1 (x)))-g(A 2 (x)) 

b. g is one-one. 

c. g maps Q onto A. 
Proof. 

a. Immediate from the proof of 5.2.23. 

b. Suppose g(x) = y = g(x'). Then 

VA(q A CinajCi/nqjCi') 
But every integer is i(A) for some A, so x = x'. 
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c. Obvious. 

□ 



Theorem 5.2.26. The nodal embedding, g, of Q into A is an isomorphism of 
Qf onto A. 



Proof. 



a. 

Qf 1= x C y iff VA(q A CnqiCi/) 

iff VA(a A C g( x ) - a A C g(y)) (by 5.2.24) 

iff A\=g(x) Cg(y) (by 5.2.16) 

b. As in (a), g can be shown to preserve 0, I, unions, intersections, relative 
complements, and proper subsets. 

c. Let nc(z,n) be the number of charmed node sets of level n contained in 
C(z). And let n, below, be the least k such that x and y are both unions 
of node sets of depth k. 

Q f ^x~yiS e f (x)=6 f (y) 

iff ncforO-GAitaOI-IAate)!) 

= nc(y,n)-(\A 1 (y)\-\A 2 (y)\) 
iffnc(. 9 (x),n)-(| 5 (A 1 (x))|-| 5 (A 2 (x))|) 
= nc(g(y),n)-(\g(A 1 (y))\-\g(A 2 (y))\) 
since Q\ is charmed iff A A is charmed 
and g preserves boolean relations 
\SA\=g(x) ~g(y) 

iff A 1= g(x) ~ g(y) since .A C A 



Qf \= x < y i& Qf \= x ~ x' C y 

for some x' G Q since Q/ N REP < 

iff .4 N .g(x) - «?(x') C 5 (y) by (b) and (c) 

iff A \= g(x) < g(y) since A N SUBSET, INDISCT 

iff A \= g(x) < g(y) since AC A 
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e. 



Qf \= Sum(x, y, z 



Jiff 



iff 



iff 



iff 



iff 



Q f \= (x'Uy' = z 
A x ~ a;' A y ~ y' 

Ai'n ?/ = 0) 

iN( fl (a;')U fl (y , )= 5 (s) 
A#) ~ g(x') 

A ff (a;')n fl (y , ) = 0) 
iN(^)U 9 (!/') = 3(z) 
Ag(x)~g(x') 

A 5 (x')n 5 (y') = 0) 
.4 N Sum(5(a;),5(y),g(^)) 
.4 t= Sum(g(a;), 5 (?;), 5 (z)) 



since Q/ h DEF 



since .4 1= DEF+ 



by (b) and (c) 



since ^4 C A 



□ 



So, each model of CAI/ has a submodel isomorphic to Qf, and we may 
conclude: 

Corollary 5.2.27. If f is total and congruous, then CAIf has a prime model. 

5.3 Summary 

We can now draw our final conclusions about CA, CS, and their completions. 

Theorem 5.3.1. If f is a total, congruous remainder function, then CAIf is 
consistent and complete. 

Proof. CAI/ is consistent because / is congruous, by 3.3.12c. Since CAI/ is 
model complete, by 5.1.6, and has a prime model, by 5.2.27, the prime model 
test (Appendix, Fact C.3) applies. So CAI/ is complete. □ 

Corollary 5.3.2. It T is a completion of CAI, then T = CAIf , for some con- 
gruous f . 

Proof. For each n > 0, T h MOD ni , for exactly one i, < i < n: 

Since T is complete, T h MOD nii or T h ->MOD nii for each such i. But if 

T h (-.MOD„ i0 A ... A nMOD„, n _i), then T is inconsistent, since T h CAI and 

CAI h DIV„. Hence T b MOD„ ;i for at least one one i, < i < n. 

But suppose T h MOD„, t and T h MOD„j where < i ^ j < n. Again, T 

would be inconsistent, for CA h MOD nji — > ^MOD nj (see Lemma 3.3.11). 



So, let f(n) = m iff T h MOD„, m . Then T h CAI/ and, since CAI/ is 



complete, CAI/ h T. 



□ 
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Corollary 5.3.3. 

a. Every completion of CA is consistent with CS. 

b. CA = CS 
Proof. 

a. Follows from 5.3.2 and 3.3.13. 

b. Follows from (a) and 3.3.2b, given that CS h CA. 

□ 

Theorem 5.3.4. 

a. For n>0, CS; EXACTLY n is deniable. 

b. CS is decidable. 
Proof. 

a. CS; EXACTLY,, h <j> iff A n N </>. But A n is a finite model. 

b. To determine whether CS h <fi, alternate between generating theorems of 
CA and testing whether A n t= -><f>. 

□ 

Corollary 5.3.5. 

a. CSI has 2" completions. 

b. For total f , CSIf is decidable iff f is decidable. 
Proof. 

a. There are 2" remainder functions whose domain is the set of prime num- 
bers. Each such function is congruous, so each corresponds to a consistent 
extension of CSI. By Lindcnbaum's lemma, each of these extensions has 
a consistent and complete extension. 

b. (=>)If / is decidable, then CAI/ is recursively enumerable. But CAI/ is 
complete, so it is decidable. 

(<^)To calculate f(n), determine which MOD njm sentence is in CSI/. 

□ 

Theorem 5.3.6. There is no sentence, <j>, such that T = CA; <j> is consistent 
and T has only infinite models. 
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Proof. If 4> is true only in infinite models of CA, then -«f> IS true in all finite 



Theorem 5.3.7. CS is not finitely axiomatizable. 

Proof. Suppose CS h (p. So CA h (f> and, by compactness, (BASIC U T) h <j> 
for some finite set of ADIV n principles, T = { ADIV„ | n <G J}. Let if = 
{ n | every prime factor of n is a member of J } . 

Let .4 be a model with domain \J keK Q fe > m wmc h size relations are deter- 
mined in accordance with the size function, 9, defined in 3.2.26. We claim the 
following without proof: 



models of CA, so -xj> e CS. But CA = CS, so CA; <fi is inconsistent. 



□ 



(1) 
(2) 
(3) 



A N BASIC 

A N ADIVj for all j E J 
APADIVk 



By (1) and (2), A N (BASIC U T), so A N cj). But by (3), A ¥ CS. Hence 
F CS. □ 
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6 

Sets of Natural Numbers 



CS has standard finite models since it consists of sentences true in all such 
models. It has infinite models Q„ and Qf. In this chapter we will show that 
CS has infinite standard models over V(N). 

An ordering of V(N) that satisfies CS will not necessarily appear reasonable. 
For example, some such orderings say that there are fewer even numbers than 
prime numbers (see below 6.2.13). To rule out such anomalies, we introduce 
a principle, OUTPACING, in section 1. OUTPACING mentions the natural 
ordering of N and applies only to subsets of N. Section 2 establishes that 
OUTPACING can be satisfied jointly with any consistent extension of CS in 
a model whose domain is P(N). So CS ; OUTPACING does not fix the size 
relations over 'P(N). 

6.1 The outpacing principle 

Throughout this chapter, x and y will range over "P(N). 

Definition 6.1.1. x outpaces y just in case the restriction of x to any suffi- 
ciently larye initial segment ofN is larger than the corresponding restriction of 
y, that is, iff: 

3nVm(m > \x^ \ > \yW\) 

Notice that the size comparison between the two restricted sets will always 
agree with the comparison of their normal cardinalities since all initial segments 
of N are finite. 

We employ this notion to state a sufficient condition for one set of natural 
numbers to be larger than another: 

OUTPACING. If x outpaces y, then x l y. 

The general motivation behind this principle should be familiar. We extrapolate 
from well understood finite cases to puzzling infinite cases. But we should also 
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emphasize, again, that this extrapolation cannot be done in any straightfor- 
ward, mechanical way without risking contradiction. We cannot, for example, 
strengthen the conditional to a biconditional, thus: 

(1) x l y iff x outpaces y. 

This revised principle conflicts with CS, for outpacing is not a quasi-linear or- 
dering. For example, neither [2n] nor [2n + 1] outpaces the other since each 
initial segment {0, . . . , 2n + 1} of N contains n evens and n odds. But the two 
are discernible under outpacing, since [2n] outpaces [2n + 2] while [2n + 1] does 
not. 

There is another point that underlines the need for care in extrapolating 
from finite cases to infinite cases: we cannot just use (2): 

(2) If, given any finite subset z of N, x restricted to z is larger 
than y restricted to z, then x > y. 

Though (2) is true, its antecedent is only satisfied when y C x. 

So, there are many statements that assert of infinite cases what is true of 
finite cases. Some of these conflict with one another. Others are too weak to 
be helpful. It is doubtful whether there is any mechanical way to decide which 
of these statements are true. The best we can do is propose plausible theories, 
determine whether they are consistent, and see how far they go. 

Definition 6.1.0. A is an outpacing model iff 

A = P(N) 

A N BASIC, anal 

A\= OUTPACING 

There is a slight difficulty in saying that an interpretation of Lc < satisfies 
OUTPACING . Since OUTPACING involves the j relation over N, it cannot be 
expressed in Lc<- We shall finesse this problem by regarding OUTPACING as 
the (very large) set of sentences 

{ b < a | a outpaces b } 

Fact 6.1.1. Every outpacing model satisfies the following 

a. [2n] > [3n] 

b. [3n] > [4n] > [5n] > . . . 

c. Ifk> 0, then [kn] > [n 2 ] 
Proof 

a. [2n] has at least (k— l)/2 members less than or equal to k for any given k. 
[3n] has at most k/3 + 1 such members. If k > 4, then (k — l)/2 > fc/3 + 1 
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b. Similar to (1). 

c. If m — fc 2 , both [fcn] and [n 2 ] have exactly k members less than to. For 
to > k(k + l), NH will have more members in [fcn] than in [n 2 ] . 

□ 

Fact 6.1.2. Every outpacing model satisfies the following 

a. [2n] > [2n + 1] 

b. [2n + 1] > [2n + 2] 

c. If [2n] > [2n + 1], then [2n + 1] - [2n + 2]. 
Proof. 

a. By TRICH, it is sufficient to show that -> [2n] < [2n +1]. If it were, then, 
by REP < , there is a y such that y ~ [2n] and y C [2n + 1]. But any 
proper subset of [2n + 1] is outpaced by, and hence smaller than [2n] . Let 
k be the least odd number not in y. Then [2n] leads y at k+ 1 and y never 
catches up. So there is no y such that [2n] ~i/C [2n + 1]. 

b. Similar to that of (1). 

c. Note that [2n] = [2n + 2] U {0} and that BASIC N (*). 

(yCxAz<y/\ Atom(z') AzCiAi = zUz')^|/~z 
Let x = [2n], y = [2n + 1], z = [2n + 2] and apply (*). 

□ 

The two alternatives left open in 6.1.2 correspond to the possibilities that N 
may be odd or even: if [2n] <~ [2n + 1], then N is even, if [2n + 1] <~ [2n + 2], 
then N is odd. In section 6.2, we show that both of these possibilities can be 
realized in standard models over P(N). Here, we generalize 6.1.2 to similar 
cases, including other congruence classes. 

Definition 6.1.3. (x, y) is an alternating pair iff x and y are infinite and 
for all i > 0, Xi < yi < Xi+i . 

Fact 6.1.4. If (x,y) is an alternating pair, then in any outpacing model: 

i~i/Vi>!/~ (x — xi) 

Proof. The argument for 6.1.2 applies here since the only facts about [2n] and 
[2n + 1] used hold by virtue of these sets forming an alternating pair. □ 

Theorem 6.1.5. For a given k > 0, let Ai = [fcn + i] for each i < k. Then: 
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a. IfO<i<j< k, then (Aj, AA is an alternating pair. 

b. There is a p, < p < k such that 

(i) If i < j < p, then Ai ~ Aj. 

(ii) Ifp ^ k, then A ~ A. p U {0}. 
(in) If p < i < k, then Ai ~ A p . 

(See example below). 

Proof. 

a. Ai(n) — kn + i,Aj(n) = kn + j, Aj(n + 1) = kn + i + k and i < j < k + i. 

b. If Aq > Ai for some i, let p be the least such i; otherwise, let p = k. 

(i) For < i < p, (Ao, Ai) is an alternating pair. So either A n > Ai or 
A n ~ Ai by 6.1.4. But A < Ai by the selection of p, so A ~ Ai. (i) 
follows by TRANS~. 

(ii) Immediate from 6.1.4 since (Ao, A p ) is an alternating pair and A > 
Ap. 

(iii) Aq > A p > Ai if i > p. So A > Ai. Hence Aj ~ A - {0} - A p . So 
Ai ~ Ap. 

□ 

Example. Let k = 4, so Ai = [4n + i] for i = 0,1,2,3. Then one of the 
following situations obtains: 

Ao ~ Ax ~ A 2 ~ A 3 > [4n + 4] 
A > Ai ~ A 2 ~ A 3 ~ [4n + 4] 
A ~ Ai > A 2 ~ A 3 ~ [4n + 4] 
A — Ai ~ A 2 > A 3 ~ [4n + 4] 

6.2 Models of CS and Outpacing 

In this section, we construct models of CS over 'P(N) that satisfy OUTPACING. 

Outpacing models will be constructed out of finite models of CS by a tech- 
nique which is very much like the ultraproduct construction common in model 
theory though the application here demands some important differences. 

Definition 6.2.1. 

a. Ln, the language of subsets of N, is the first order language which 
results from adding to Lq<, as individual constants, a name for each subset 
ofN. 
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b. A n is the standard finite interpretation of Ln over V(NW) in which: 
A„(a) = a n N W = a [n] for each a C N. 



Definition 6.2.2. (cf. \Monk\ Def. 18.15, p318] If X is a set and F C V(X), 
then 

a. F has the finite intersection property iff the intersection of any finite 
subset of F is non-empty. 

b. F is a filter over X iff 

(i) 

(ii) If a £ F and a C b, then b £ F, and 
(Hi) If a E F and b £ F, then a (~l b £ F 

c. F is an ultrafilter over X iff 

(i) F is a filter over X 
(ii) X £ F, and 

(Hi) ifY CI, then either Y £ F or (X - Y) £ F . 

d. An ultrafilter, F , over X is principal iff there is some x £ F such that 
F = {a C X \ x £ a} 

Fact 6.2.3. 



a. A non-principal ultrafilter contains no finite sets. [ Bel\ , Ch.6, lemma 1.3, 
p. 108] 

b. A non-principal ultrafilter over X contains all cofinite subsets of X. 

c - If F C V(X) and F has the finite in tersect ion property, then there is an 



ultrafilter over X which includes F. t Monk , Prop. 18.18, p. 319] 



d- IfYQX and Y is infinite, then there is a non-principal ultrafilter over 
X which contains Y . 



Definition 6.2.4. If F is an ultrafilter over N, then Af is the interpretation 
of Lc< i n which 

a. Ayr = V{n), 

b. Af N a< b iff {k\ a [k] < 

5W I g p t an d similarly for other predicates. 

c. Boolean symbols receive the usual interpretation. 

Our main result is that if F is non-principal, then Af is an outpacing model. 
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Theorem 6.2.5. If F is a non-principal ultrafilter over N, then 

a. If t is a term of Ln, then 

A k (r) = A f {t) n A k = A F (r) lk] 

b. If 4> is a quantifier free formula of Ln, then 

A F iff {k | A k N0} EF 

c. If (j> is a universal formula of Ln and Ak N <t> for every k, then Af 1= <f>- 

d. A F \= REP<. 

e. A F N BASIC. 

f. Af 1= ADIV n , for every n. 
Proof. 

a. By induction on the structure of r: 

(i) If t is a constant, r = a for some a C N. So Ak(r) — a'"' by 6.2.1b. 

(ii) If T = r Ti U T2 n , 

AW = A(Tl) U A k (T 2 ) 

= (A f (ti) n A k ) u (^ F (r 2 ) n A fe ) 
= (A f (ti) U A f (t 2 )) r\ A k 

= i f (TiUT 2 )n4 

The proofs for intersections and relative complements are similar. 

b. By induction on the structure of <j>: 

(i) If 4> = r a C & n , then Af 1= 4> iff a C 6 

If a C 6, then there is a k G 6 but not e a. So if n > fc, a [ "l C 
Hence, {n | .4^ 1= 0} is cofinite and, by 4.2.3b, in F. 
Conversely, if { n | a'™' C 6'"' } G -F, then it is infinite. So, there 
cannot be a A: in a but not in b; otherwise would not be included 
in for any n greater than k. So a C 6. But, clearly a ^ 6, so 
a C 6. 

(ii) If 4> = r a = & n , then 

Af \= <f> i& a = b 

iff a> n ' = & W for all n. 
iff {n\An\= a = b} = N 
iff {n|A^a = !)}eF 

since if Y a — b and k > n, then .4^ >^ a = 6. 
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(iii) A F N a < b iff { k \ A k N (a < b) } e F. Immediate from 6.2.4b. 

(iv) If is non-atomic, then, since F is an ultrafflter: 

Af 1= 0i A 02 iff Af \= 0i and Af \= 02 

iff { k | A k N 0! } G F and { fc | A N 2 } £ F 
iff { k | A N 0! A 2 } G F 



c. 



A 1= -.0 iff A ^ 

iff {fc| AN0}^F 
iff { fc | A k N -.0 } G F 

Suppose A N \/xcf)(x), for all fc 

then A k 1= 0(a), for all a, for all /c 
So Ak 1= 0(a), for all k, for all a 

So Af 1= 0(a), for all a by (b) 

So A N Vx0(x) 

d. Suppose Af 1= (a < 6). Construct a', a subset of b, for which Af (= (a ~ a') 
as follows: 

Let F = { k | a w < 6 [fe] } so, F G F. 

= {fei, . . . , h, . . .} where the ki's are in strictly increasing order. 
Let a = 

a i+ i = en U | the n greatest members of &[ fei+1 J not in a* j , 
where n = \a ik '+ l] \ - |aj| 
a' = y a. 

Then a 1 <zb since each draws its new members from 6. 
Claim: If fc G F, then a' [fcl - a^. 

Hence: if N a' ~ a since they are the same size over some set which 
contains F and is, thus, if F. 

e. Immediate from (c) and (d): BASIC is equivalent to a set of universal 
sentences, together with ATOM and REP<. (A F 1= ATOM because it 
contains all singletons of natural numbers.) 

f. For < i < n, x an infinite subset of N, let Xi = { Xk n +i-i | k E N }. 

The n sets, x i7 partition x. Furthermore, these form an alternating n- 
tuple, in the manner of the congruence classes modulo n (see Theorem 
6.1.5). As in 6.1.5, these sets are approximately equal in size and Af 1= 
ADIV„(ar). 
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□ 

Corollary 6.2.6. If F is a non-principal ultrafilter over N, then 

a. Af N CAI, and 

b. A F N CSI 
Proof. 

a. Immediate from 6.2.5e and 6.2.5f. 

b. Immediate from (a) and 5.3.3b. 

□ 

The proof of 6.2.6 is modeled on the usual ultraproduct construction, but 



is not quite the same. In the usual construction(see, for example, [Bel], pp. 
87-92]), a model is built by first taking the product of all factors (in this case, 
the Ak), which results in a domain whose elements are functions from the index 
set (N, here) to elements of the factors. These functions are then gathered 
into equivalence classes (by virtue of agreeing almost everywhere, i.e. on some 
member of the filter) and the reduced ultraproduct is defined by interpreting 
the language over these equivalence classes. The model so constructed, which 
we'll call Y[ ■Ai/F, has the handy property that it satisfies any formula which is 
satisfied by almost all factors, and certainly any formula which is satisfied in all 
the factors. This is handy because, given that each of the Ak satisfies CS, we 
can immediately conclude that n^/^ 1 satisfies CS. 

Unfortunately, n^i/^ 7, i s n °t the model we wanted: its elements are not 
subsets of N, but equivalence classes of functions from N to finite subsets of 
N. There is, indeed, a natural mapping from subsets of N to such elements, 
and this mapping would have allowed us to identify a model over 'P(N) as a 
submodel of T\ Ai/F; but only a submodel. So, had we constructed the reduced 
ultraproduct, we would have then been able to infer that the part of that model 
which held our interest satisfied all universal formulas of CS; we still would 
have had to resort to special means to show that the non-universal formulas 
were likewise satisfied. 

Fortunately, these special means were available; the only non-universal ax- 
ioms of CAI could be verified in the constructed model more or less directly, and 
the completeness proof of the last chapter allowed us to infer that all formulas 
true in all of the factors are true in the model Af, after all. 

For a given F, there will be many cases where Af t= a < b even though 
b does not outpace a. This will happen whenever { k | < b^ } 6 F but is 
not cofinite. Consider a familiar example: Let a = [2n + 1] and b = [2n]. Then 
a [k] < f)[k] iff e [2n], for if we count the even numbers and the odd numbers 
up to some even number, there will always be one more even and if we count up 
to some odd number, there will always be the same number of evens and odds. 
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[2n] is neither finite nor cofinite, so it may or may not be in F, If [2n] € F, 
A F N [2n + 1] < [2n]. Otherwise, [2n + 1] e F, so .4 F N [2n + 1] - [2n]. 

The construction of a model Af from any non-principal ultrafiltcr, F, sug- 
gests that there are many outpacing models unless different ultrafiTters can yield 
the same model. We will first show that this qualification is not needed. 

Theorem 6.2.7. If F± and F2 are distinct non-principal ultrafilters over N, 
then Af! ^ -^f 2 ■ 

Proof. (See below.) □ 

To show this, we will show that the presence of a set in an ultrafilter makes a 
direct, personalized contribution to the model Af- Putting this in another way, 
there is a set of decisions that must be made in constructing an outpacing model; 
each decision may go either way, though the decisions are not independent of 
each other. Furthermore, each decision is made for a model Af by the presence 
or absence of a particular set in F. 

Definition 6.2.8. If x C N, then x+ = {i + 1 \ i e x} . 

A pair of sets, (x, x + ), can sometimes be an alternating pair, but this is not 
always the case. 

Fact 6.2.9. (x, x + ) is an alternating pair iff x is infinite and there is no n e x 
such that (n + 1) £ x. That is, no two consecutive numbers are in x. 

Nevertheless, pairs (x,x + ) are like alternating pairs in the following way: 

Lemma 6.2.10. If x is infinite, xl = (x — x + ), x2 = (x + — x), F is a non- 
principal ultrafilter, and A = Af, then 

a. (xl,x2) is an alternating pair. 

b. A N x ~ x + orA\=x>x + ^x — {xi } 

c. A \= x > x + iff x e F. 
Proof. 

a. Let a run of x be a maximal consecutive subset of x. (So [2n] has only 1- 
mcmbered runs, while N— [lOn + 1] has only 9-membered runs.) So, x\(n) 
is the first element in the n-th run of x and xi (n) is the first element after 
the n-th run of x. 

b. We know from (a) that X2 ~ Xi or xi ~ x\ —x\(X). But the disjoint union 
of x fl x + with x\ or X2, respectively, yields 1 or i + . So (b) follows from 
DISJ U . 

c. We need only prove (*): 



(*) 




I iff n € x 
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Informally: a;'"' first becomes greater than x + for n = x(l), since x(l) ^ 
x + because -i(xi — Xi(l) C x). Throughout the first run of x, x maintains 
its lead, losing it only at the least n for which n x (since (n — 1) G x, so 
n G x + ). This pattern repeats during successive runs of x. 

□ 

We can now prove our main result, Theorem 6.2.7: 

Proof. Without loss of generality, we can suppose there is a set, x, such that 
x e Fi and x £ F 2 . By 6.2.10c, A Fl N x > x+ and Af 2 □ 

Theorem 6.2.7 allows us to improve upon some previous results. For example, 
we can show that either of the alternatives in 6.2.10b can obtained for any 
alternating pair. 

Theorem 6.2.11. 

a. If neither x nor y outpaces the other, then there is an ultrafilter F, such 
that Af N (x ~ y). 

b. If (x, y) is an alternating pair, then there is an ultrafilter F, such that 
A F ^{x> y). 

Proof. 

a. Let J = { k | \x^ \ = ly^l }. Since neither x nor y outpaces the other, J 
is infinite. By 6.2.3d, let F be a non-principal ultrafilter which contains 
J. Then A F 1= x ~ y. 

b. If (x, y) is an alternating pair, so is (y, x — x(l)). So, by (a) there is an F 
such that Af 1= y ~ (x — x(l). But then Af \= x > y. 

□ 



Theorem 6.2.12. Every infinite completion of CS has an outpacing model. 

Proof. Recall that every infinite completion of CS is equivalent to CSI/ for some 
total and congruous remainder function, /. (See 3.6.2.) 

Given such an /, let Gk — [kn + f(k) + 1] for each k > 0. Then (*) holds 
for each k: 

(*) G k = { n | An N MOD fe;/(fe) } 

Let G= {G k | k> 0}. 

The intersection of any finite subset, H, of G is infinite. If H is a finite 
subset of G, then H = {Gk | k G J }, where J is some finite subset of N + . So 
H = {n+l\keJ^n = f(k) mod k }. But / is congruous, so the restriction 
of / to the finite domain J has infinitely many solutions (see 3.3.8a). 

Since the intersection of any finite subset of G is infinite, there is a non- 
principal ultrafilter F such that G C F. By (*), Af 1= MOD fc j( fe ), so Af 1= 
CSI/. □ 
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On the basis of 6.2.11 we noted that there are even and odd outpacing 
models; we can now extend that observation to moduli other than 2. More 
specifically, all of the possibilities listed in 6.1.5 for the relative sizes of the 
fc-congruence classes are obtainable in outpacing models. 

This section has explored the existence and variety of outpacing models. 
Three comments are in order before we turn to the common structure of out- 
pacing models. 

First, even it T is an infinite completion of CS, there is no unique outpacing 
model which satisfies T. This would be true only if fixing the congruence classes 
determined whether x ~ x + or x > x + for every x C N. That all such choices 
are not determined by a remainder theory can be seen intuitively, perhaps, by 
considering x = [n 2 ] : any finite set of congruences has infinitely many solutions 
that are squares and infinitely many that are not; so whether x G F is an 
independent choice. Also, there are only 2" remainder functions while there are 
2 2 non-principal ultrafilters over PJ|Befl, Ch. 6, Theorem 1.5], each yielding a 



different outpacing model. 

Second, it is not clear whether every outpacing model can be obtained by the 
construction of 6.2.4. Lemma 6.2.10c may suggest that any outpacing model, 
A, is Af for Fj[ = {x\A^x>x + }, but it should not. To establish that A = 
Af, both (1) and (2) are necessary. 

(1) If A is an outpacing model, then Fa is a non-principal ultra- 
filter. 



(2) If F A = F B , then A = B 



I have not been able to prove (1) or (2). If (1) is false, then clearly A ^ Af- 
But even if (1) is true, two outpacing models may agree about all pairs {x,x + } 
but disagree elsewhere. At most one of them is obtainable by our construction. 
So (1) and (2) are open problems. 

Finally, though it may be extraneous to show that OUTPACING is inde- 
pendent, we will do so. 

Theorem 6.2.13. There are standard models of CS over 'P(N) which do not 
satisfy OUTPACING. 

Proof. Suppose that < is a linear ordering under which N forms an oj-sequence. 
Then we could define x <-outpaces y as follows: 

3n\fm[n <m—>-\{k\kExAk<'m}\>\{k\k<ExAk<n}W 

and define the principle: 

OUTPACING< . If x < -outpaces y, then x > y. 

Modifying 6.2.4, we could produce standard models of CS over V(N) which 
satisfy OUTPACING< and these will not, in general, satisfy OUTPACING. 
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Suppose, for example, that < is the ordering: 

Pi, Qi, ■ ■ ■ ,Pk,Qk, ■ ■ ■ 

where p is the set of prime numbers and q is its complement. In any model 
of OUTPACING< , p and q will be nearly the same size, as the evens and the 
odds arc in normal outpacing models. But the evens are much smaller than q, 
sop> [2n] and OUTPACING is false in OUTPACING< models. □ 

6.3 Size and density 

When number theorists talk about the sizes of sets of natural numbers, they do 
not content themselves with speaking of the (Cantorian) cardinalities of these 
sets. Since they often want to compare infinite subsets of N, they need a more 
discriminating notion. 

One notion they use is asymptotic density. The asymptotic density of a 
set, x, of natural numbers is the limit, if there is one, of \x^\ as n grows. For 
example, the asymptotic density of [2n] is 1/2. From now on we shall use the 
term 'density' for asymptotic density. 

In this section, we compare the ordering of V(N) given by density to the 
orderings given by CS and OUTPACING. 

Definition 6.3.1. 

a. x % — ^-1 , the fraction of numbers less than or equal to i that are members 
of x. 

b. If x C y ^ 0, then p(x,y), the density of x in y, is the limit, if it exists 
of 

lim x l jy l 

i=yi 

That is, 

p(x,y) = r iff ^{8 > 0)3n(Vi > n)(r - S < ^ < r + S))) 

c. The density of x, p(x), is the density of of x in N, if x has a density in 
N. 

d. If x C y ^ 0, then x converges in y, cvg(x,y), iff x has a density in y; 
otherwise, x diverges in y. 

e. x converges, cvg{x), iff x converges in N. x diverges iff x diverges in 
N. 



Fact 6.3.2. 
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a. If x is finite, p(x) = 0. 

b. If x is cofinite, p(x) = 1. 
c. 

p([2n]) = 1/2 
p([4n],[2n]) = l/2 
p([4n]) = 1/4 

d. If cvg(x,y) and cvg(y,z), then cvg(x,z) and p(x,z) = p(x,y)p(y,z). 

Fact 6.3.3. 

a. There are divergent sets. 

b. IfO < r < 1, there is a set with density r. 

c - If0<r<l and y is infinite, there is a set with density r in y. 
Proof. 

a. Let x = { i | 3n(10 2 ™ < i < 10 2ll+1 }. So x contains all numbers between 
and 9, between 100 and 999, between 10,000 and 99,999, and so forth. 
If n > 1, then x w < .1 and x 10 " > .9. So x k cannot have a limit. 

b. Suppose r is given. Construct the set x as follows: 

x = 

{Xi, if x l > r: 

Xi\i + 1, otherwise. 

x = U x% 

c. Modify the construction for (b) in the obvious ways. 

□ 



Theorem 6.3.4. Suppose that both x and y converge in z. Then, if p(x,z) < 
p(y,z), y outpaces x. 
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Proof. 



Let 


b= (p(y,z) - p(x,z))/3 








Let 


ni = the least n such that for i I n, 


- b < x l /z l 


- p{x, 


z)<b 


and let 


n 2 = the least n such that for i I n, 


- b < y'/z* 


-p(y, 


z)<b 


So 


for any i > n\ + n 2 , 








we have 


x l jz l < p(x, z) + b 








and 


f/z l > p(y,z)-b 








But 


p(x,z) + b< p(y,z) - b, 








So 


x 1 /z l < f /z % 








So 


x l < f 








So 


\x® \ < 









□ 



We can use the relation between density and outpacing to draw conclusions 
about densities that have nothing to do with outpacing, as in Theorem 6.3.5. 

Theorem 6.3.5. If p(x,z\) < p(y,z\), and both x and y converge in z 2 , then 
p{x,z 2 ) < p(y,z 2 ) 

Proof. Since p(x, z\) < p(y, zi), y outpaces x. But if p(x, z 2 ) > p{y 1 z 2 ), then x 
outpaces y. So, p{x,z 2 ) < p(y,z 2 ). □ 

We cannot strengthen the consequent of 6.3.5 to say that p(x, z 2 ) < p(y, z 2 ): 
Let zi be the set of primes, let x contain every third member of z\, and let y 
be zi - x. Then p{x,zx) = 1/3 and p(y,zi) = 2/3, but p{x,N) = p(y,N) = 0. 

Theorem 6.3.4 implies that in any outpacing model, sets with distinct den- 
sities will have distinct sizes. Even if two sets have the same density, they will 
differ in size if they have the same density in some common set. So, from Facts 
6.3.b and (c), we can begin to appreciate how precise an ordering outpacing 
models provide: 

Fact 6.3.6. If A is an outpacing model, then 

a. there are uncountably many sizes of sets in A, and 

b. if < r < 1, then even among sets with density r, there are uncountably 
many sizes in A. 

Proof. 

a. Immediate from Fact 6.3.3b and Theorem 6.3.4. 

b. Let x be an infinite set with density r, let y be an infinite subset of x, 
where p(y, x) — 0, and let z = x — y. There are uncountably many subsets 
of y with distinct densities in y, though p(yi,x) = 0, for any yi C y. 
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Suppose now that y x C y, y 2 C y, and < p(?/ 2 ). Then .4 1= j/i < y 2 , 

by 6.3.4. so i N z U j/i < z U y 2 , by DISJ U . 

But p(z U ?/i ) = p(z U 2/2) = since 2 was obtained by removing from 
x a set with density relative to x. 

□ 

6.3.1 The Convexity Problem 

It's tempting to infer from these results that the extremely fine ordering of sets 
by size (or, rather, any such ordering which is realized in an outpacing model) 
is both a refinement and a completion of the ordering suggested by density: a 
refinement because it preserves all differences in size which are captured by the 
notion of density, a completion because all sets are located in a single, linear 
ordering of sizes. 

But the situation is really not so clear. It is evident that the size ordering 
over V(N) in any outpacing is a refinement of the ordering by cardinality: if 
x has a smaller cardinal number than y, then x is smaller than y, though two 
sets with the same cardinal number may have different sizes. We can regard 
the cardinality of a set in V(N) as determined by its size, though different sizes 
may yield the same cardinal number. We shall express this fact by saying that, 
at least when we focus on V(N), cardinality is a function of size. (It is not at 
all clear that this is true in any power set.) 

Now, we want to know whether the density of a set is a function of its 
size. It turns out that a negative answer is compatible with our theory (CS 
and OUTPACING), while an affirmative answer may or may not be consistent. 
First, we will give a more precise formulation of this problem; second, we will 
show that a negative answer is consistent; finally, we'll discuss the consistency of 
an affirmative answer. In passing, we'll explain why this is called the convexity 
problem. 

Consider (1) and (la): 

(1) If x ~ y and p(x) = r, then p(y) = r. 

(la) If x ~ y and p(x) = r and cvg(y), then p(y) = r. 

(la) is an immediate consequence of Theorem 6.3.4. For if y converges, there is 
some r 2 = p(y); if r < r 2 , then x < y and if r > r 2 , then x > y. But x ~ y, so 
r 2 = r. 

So, the questionable part of (1) can be expressed as (2): 

(2) If x ~ y and x converges, then y converges. 

Theorem 6.3.4 insures that (1) just in case (2). Recalling that sets may have the 
same density even though they differ in size, we may consider two additional 
formulations: 

(3) If p(x) — p(y) and x < z < y, then p(z) = p(x) 
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(4) If p(x) = p(y) and x < z <y, then z converges. 

(3) and (4) are equivalent for the same reasons that (1) and (2) are equivalent. 

Fact 6.3.7. An outpacing model satisfies (1) iff it satisfies (3). 

Proof. 

=> Suppose that x < y < z and p(x) = p(z). 

By REP < , there are two sets, x' and y' , for which 

x 1 C y' C z, x ~ x 1 , y ~ y' . 

Since x <~ x' 

and p(x) = p(z) 

then p{x') = p{z) 

So p(z -x') = 

and p(z — y') = 

But y' = z - (z - y') 

So p(y') = p(z) 
So = p(z) 

Suppose that x ~ y an< i / 9 ( a; ) = r - 
If .t = or x = N, then y = x, so p(y) — p(x) 
To apply (3), we need to find two sets, x\ and x 2 such that 

pOi) = P(x2) = r = p{x) 

and 

x\ < y < x 2 
Assuming that x ^ and i^N, let 

x\ = x — a, for some a <E x 

and 

x 2 = x U 6, for some 6 ^ x. 

Then xi < x < x 2 , by SUBSET, and xi < y < x 2 , by INDISC~. 

Since adding or removing a single element has no effect on the density of 
a set, 

p(xi) = p(x) = p(x 2 ) 

So, by (3), p(y) = r. 



by (1) 



by(i) 
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□ 

The question at hand is called the 'convexity problem' because of the for- 
mulation in (3). In geometry, a figure is convex if, given any two points in the 
figure, any point between them (ie on the line segment from one to the other) is 
also in the figure. Applying this notion in the obvious way to the size ordering, 
(3) says that the class of sets having a given density is convex. By theorems 
6.3.4 and 6.3.7, (1), (2) and (4) say the same thing. 

We regret that the only thing we know about (3) is that it may be false: 

Theorem 6.3.8. The negation of (1) is satisfied in some outpacing model. 

Proof. Let x be a set with density r and let y be a divergent set which neither 
outpaces nor is outpaced by x. Let K be 

{ n | x [n] = y [n] } 

K is infinite, so K is a member of some non-principal ultrafiltcr, F. Af N x ~ y, 
so (1) is false in Af- □ 

Open Problem: Is (1) consistent with CS and OUTPACING? 
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A Notation 

A.l Predicate Logic 

The formal theories discussed in this thesis are theories with standard formal- 



ization, in the sense of [Tarski, p. 5]. That is, they are formalized in first order 
predicate logic with identity and function symbols. The following notation is 
used for the predicate calculus: 



A 


and 


V 


or 


— 1 


not 




if ... then 




if and only if 




identical 




not identical 


3x 


there is an x 


Vx 


for all x 



Conjunctions and disjunctions of sets of sentences are represented by: 



and 



A 



V 



A first order language is determined by its non-logical constant symbols, 
in the usual way. These may be predicates, individual constants, or function 
symbols. In most cases, the ranks of the symbols will accord with their familiar 
uses and we do not stipulate them. 

A schematic function is a function whose range is a set of formulae. DIV„, 
for example, maps natural numbers into sentences (see 3.2.8). The arguments 
of such functions are indicated by subscripts. 
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A.2 Set Theory 

For first order languages with boolean operations and predicates, we use the 
following notation: 

I the universe 

the empty set 

xUy the union of x and y 

xPiy the intersection of x and y 

x — y the relative complement of y in x 

x C y x is contained in y 

x C y x is a, proper subset of y 

These symbols are also used for set theoretic relations, outside of first order 
languages. 

In addition, we use the following: 



x e y 


x is a member of y 




V(x) 


the power set of x 




{x\<f>(x)} 


the set of a:'s which are (f> 




x; a 


the union of x and a 




x™ 


the members of x less than or 


equal to n 


N 


the set of natural numbers: 0, 


1, ... 


N+ 


N-0 




Z 


the set of integers 




U) 


the smallest infinite cardinal 




2" 


2 to the ui 





A. 3 Arithmetic 

For arithmetic, we use the following: 



i + j i plus j 

i — j i minus j 

ij i times j 

V i to the j-th power 

i\ i factorial 

i | j i divides j 

gcd(i,j) greatest common divisor of i and j 

n = m mod j n is congruent to m modulo j 
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We use the following notation for sets of natural numbers: 
[...n...] ={k\ 3n(k = ...n...)} 

For example: 

[fen + j] = the set of numbers congruent to j modulo k 
[n 2 ] = the set of squares of natural numbers 



B Model Theory 

This section lists the model-theoretic notions and results assumed in the text 
and presents our notation for these notions. 

An interpretation, A, of a first order language, L, consists of a domain, .A, 
and a function which assigns to each individual constant of L a member of A, 
to each n-place predicate of L, a set of n-tuples over A, and to each n-argument 
function symbol of L, an n-ary function defined over and yielding values in A. 



We assume the notion of satisfaction as defined in [Chang, section 1.3] and 

use 

A N (f> to mean that A satisfies <f> ■ 

Definition B.l. Familiar notions about models. 

a. A is a submodel of B (14 C B). 

b. B is an extension of A (AQB). 

c. The submodel of B generated by X , where X is a subset of B. 

d. A is isomorphic to B (A ~ B). 

e. f is an isomorphic embedding of A into B. 

f. B is an elementary extension of A. 

g. {Ai} is a chain of models. 

h. A — 1J {-4i}; the union of a chain of models. 



Definition B.2. Notions about theories: 

a. A theory is a set of first- order sentences. 

b. The language of a theory, Lx- 

c. T proves <fi (T h <f> ); 
T proves T 2 (T h T 2 ). 
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d. T is complete. 

e. T is consistent. 

f. T is categorical. 

g. T is equivalent to T 2 (T = T 2 ). 

Fact B.3. The following are well known facts: 

a. If T \- <fi, then some finite subset of T proves <f> (compactness) . 

b. If T is complete and T; <f> is consistent, then TV- <$>. 

c. If T is categorical, then T is complete. 

Definition B.4. More familiar notions: 

a. T is existential iff it is equivalent to a set of prenex sentences, none of 
which have universal quantifiers. 

b. T is universal iff it is equivalent to a set of prenex sentences, none of 
which have existential quantifiers. 

c. T is universal- existential iff it is equivalent to a set of prenex sen- 
tences, each of which has all of its universal quantifiers preceding any of 
its existential quantifiers. 

d. <j) is a primitive formula iff <f> is an existential formula in prenex form 
whose matrix is a conjunction of atomic formulas and negations of atomic 
formulas. 

Fact B.5. Fairly familiar facts: 

a. If T is existential, A^T, and A Q B, then 01= T. 

b. If T is universal and A N T, then any submodel of A also satisfies T. 

c. IfT is universal-existential and Ai N T , for all i > 0, then [J {At} \= T . 

C Model Completeness 

This section presents the definition of model completeness and some basic results 
needed in Chapters 4 and 5. 

There are several ways to show that a theory is complete. We use only one, 
which is based on A. Robinson's notion of model completeness. 

Definition C.l. T is model complete iff T is consistent and for any two 



models, A and B of T , A C B iff A is an elementary submodel of B. Monk, p. 
355]. 
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A model complete theory is not necessarily complete, unless the theory has 
a prime model. 

Definition C.2. A is a prime model of T if A\= T and A can be embedded 



in any model of T . [Monk, p. 359] 



Fact C.3. IfT is model complete and T has a prime model, then T is complete. 

To show that a theory is model complete we rely, directly or indirectly, on a 
theorem of Monk's (see 4.2.1a), which is based on some equivalent formulations 
of model completeness: 

Definition C.4. 

a. The A- expansion of L is the result of adding to L a constant for each 
element of A. 

b. The diagram of A is the set of atomic sentences and negations of atomic 
sentences of the A-expansion of the language of A which are true in A. 



Fact C.5. The following are equivalent ]Monk, p. 356]: 

a. T is model complete. 

b. For every model, A, ofT and every A-expansion L' of L, the TUL' '-diagram of L 
is complete. 

c. If A and B are models of T , A Q B , (j) is a universal formula, x £ A, and 
A N 4>(x), then B N (f>{x). 

d. If A and B are models of T , A C B , 4> is a primitive formula, x £ A, and 
B \= 4>{x), then A N (f>(x). 
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x > y, 1 
^ > y, | 

Model Theory 

T h 
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T Y- 
T x = T 2 , p.03 
«4 1= 



102 



.4^ , pa 
b, pat 

Models and D 




„|27| 

Za, 
Za, 



Sizing 

x, 

c(x)M 



Ai(x), 
A 2 (x), 




p(^,y), N 

(T^[x], ^ 

cvg(x,y), H 
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